top | new | best | ask | show | jobs

top | item 39699449

Sequoia: Speculative decoding boosting LLM inference by 8-10x

3 points| fgfm | 1 year ago |infini-ai-lab.github.io

discuss

order

No comments yet.

powered by hn/api // news.ycombinator.com