(no title)
m11a | 6 months ago
I ended up blown away. via Cerebras/Groq, you're looking at around 1000 tok/sec for the 120B model. For gentic code generation, I found the abilities to exceed gpt-4.1. Tool calling was surprisingly good, albeit not as good as Qwen3 Coder for me.
It's a very capable model, and a very good release. The high throughput is a game changer.
No comments yet.