top | new | best | ask | show | jobs

top | item 43361695

Accelerate CPU Based LLM Inference with a Vector Index on the Output Embeddings

1 points| dithered_djinn | 11 months ago |martinloretz.com

discuss

order

No comments yet.

powered by hn/api // news.ycombinator.com