top | item 42867393

(no title)

zbendefy | 1 year ago

No, the full R1 model is ~650GB. There are quantized version that quantize it down to ~150GB.

What you can run locally are the distilled models, that is actually LLama and Qwen weights further trained on R1's output

discuss

order

No comments yet.