top | item 47093564

(no title)

rhdunn | 9 days ago

Mistral have small variants (3B, 8B, 14B, etc.), as do others like IBM Granite and Qwen. Then there are finetunes based on these models, depending on your workflow/requirements.

discuss

order

karmasimida|9 days ago

True, but anything remotely useful is 300B and above

Eupolemos|9 days ago

That is a very broad and silly position to take, especially in this thread.

I use Devstral 2 and Gemini 3 daily.