top | item 47086749

(no title)

adityashankar | 11 days ago

This depends on how much better the models will get from now in, if Claude Opus 4.6 was transformed into one of these chips and ran at a hypothetical 17k tokens/second, I'm sure that would be astounding, this depends on how much better claude Opus 5 would be compared to the current generation

discuss

order

empath75|11 days ago

Even an O3 quality model at that speed would be incredible for a great many tasks. Not everything needs to be claude code. Imagine Apple fine tuning a mid tier reasoning model on personal assistant/MacOs/IOS sorts of tasks and burning a chip onto the mac studio motherboard. Could you run claude code on it? Probably not, would it be 1000x better than Siri? absolutely.

JKCalhoun|10 days ago

Yeah, waiting for Apple to cut a die that can do excellent local AI.

aurareturn|11 days ago

I’m pretty sure they’d need a small data center to run a model the size of Opus.