top | item 46727199

(no title)

kevinlu1248 | 1 month ago

Unfortunately, the main optimization (3x speedup) is using n-gram spec dec which doesn't run on CPUs. But I believe it works on Metal at least.

discuss

order

No comments yet.