(no title)
sophia01 | 7 months ago
The difference here seems to be that Cerebras does not appear to have Qwen3-Coder through their API! So now there is a crazy fast (and apparently good too?) model that they only provide if you pay the crazy monthly sub?
social_quotient|7 months ago
The way I would use this $50 Cerebras offering is as a delegate for some high token count items like documentation, lint fixing, and other operations as a way not only to speed up the workflow but to release some back pressure on Anthropic/claude so you don’t hit your limits as quickly… especially with the new weekly throttle coming. This $50 dollar jump seems very reasonable, now for the 1k completions a day, id really want to see and get a feel for how chatty it is.
I suppose thats how it starts but id the model is competent and fast, the speed alone might force you a bit to delegate more to it. (Maybe sub agent tasks)
pxc|7 months ago
sophia01|7 months ago
baq|7 months ago
it's two kilotokens per second. that's fast.
bangaladore|7 months ago
Certainly, somewhere between fast and crazy.
amelius|7 months ago
In other words, it's needlessly fast.
ttoinou|7 months ago