top | item 44811841

(no title)

m11a | 6 months ago

I tried these models half-sceptically.

I ended up blown away. via Cerebras/Groq, you're looking at around 1000 tok/sec for the 120B model. For gentic code generation, I found the abilities to exceed gpt-4.1. Tool calling was surprisingly good, albeit not as good as Qwen3 Coder for me.

It's a very capable model, and a very good release. The high throughput is a game changer.

discuss

No comments yet.