top | new | best | ask | show | jobs

top | item 46987788

NanoQuant: Efficient Sub-1-Bit Quantization of Large Language Models

13 points| chrsw | 17 days ago |arxiv.org

discuss

order

No comments yet.

powered by hn/api // news.ycombinator.com