top | item 45622732

(no title)

tkz1312 | 4 months ago

which 671b quants can fit into 96GB VRAM? Everything I’m aware of needs hundreds at least (e.g. https://apxml.com/models/deepseek-r1-671b).

discuss

order

lossolo|4 months ago

5090 is 32 GB so it's 128GB, not 96.

tkz1312|4 months ago

128 is still not 300. Something like 4x 6000 blackwell is the minimum to run any model that is going to feel anything like claude locally.

To my deep disappointment the economics are simply not there at the moment. Openrouter using only providers with zero data retention policies is probably the best option right now if you care about openness, privacy and vendor lock-in.