top | item 43894870 (no title) ericboehs | 10 months ago Interesting. Does this mean larger models could be ran on less memory? It looks like it uses 15-20x less memory. Could a 671B DeepSeek R1 be ran in just ~40-50GB of memory? It sounds like it'd be 1/3 as fast though (<1tk/sec). discuss order hn newest No comments yet.
No comments yet.