Removed stray cuda call

by justbruno - opened 25 days ago

base: refs/heads/main

←

from: refs/pr/2

Discussion Files changed

-1

justbruno

25 days ago

A stray cuda call was preventing this model from being used on devices without a GPU or TPU.

The causal mask is allocated to input_ids.device upon return, as it should.

Removed stray cuda call4fe741ca

loubb

Owner 24 days ago

Thanks for pointing out this oversight - I just merged a fix.

By the way, if you intend to use this model for anything intensive, I'd reccomend checking out the training & inference (torch /w cudagraphs and MLX) implementations on the GitHub repo.

https://github.com/EleutherAI/aria

loubb changed pull request status to closed 24 days ago

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment