top | item 46727199 (no title) kevinlu1248 | 1 month ago Unfortunately, the main optimization (3x speedup) is using n-gram spec dec which doesn't run on CPUs. But I believe it works on Metal at least. discuss order hn newest No comments yet.
No comments yet.