top | item 42832230

(no title)

yetanotherjosh | 1 year ago

ollama is stating there's a difference: https://ollama.com/library/deepseek-r1

"including six dense models distilled from DeepSeek-R1 based on Llama and Qwen. "

people just don't read? not sure there's reason to criticize ollama here.

discuss

order

whimsicalism|1 year ago

i’ve seen so many people make this misunderstanding, huggingface clearly differentiates the model, and from the cli that isn’t visible