(no title)
dpe82
|
12 days ago
It's wild that Sonnet 4.6 is roughly as capable as Opus 4.5 - at least according to Anthropic's benchmarks. It will be interesting to see if that's the case in real, practical, everyday use. The speed at which this stuff is improving is really remarkable; it feels like the breakneck pace of compute performance improvements of the 1990s.
madihaa|12 days ago
scottmf|11 days ago
2026: Everyone is spending $500/month on LLM subscriptions
mooreds|12 days ago
Something something ... Altman's law? Amodei's law?
Needs a name.
turnsout|12 days ago
nimonian|12 days ago
amelius|12 days ago
Yeah, but RAM prices are also back to 1990s levels.
mrcwinn|12 days ago
mikkupikku|12 days ago
dpe82|12 days ago
https://claude.ai/public/artifacts/67c13d9a-3d63-4598-88d0-5...
coffeebeqn|12 days ago
thinkling|12 days ago
https://bsky.app/profile/simonwillison.net/post/3meolxx5s722...
AstroBen|12 days ago
dyauspitr|12 days ago
satvikpendem|12 days ago
Yeah it's really not. Sonnet still struggles while Opus, even 4.5 succeeds (and some examples show Opus 4.6 is actually even worse than 4.5, all while being more expensive and taking longer to finish).
justinhj|12 days ago
karmasimida|12 days ago
You should always take those claim that smaller models are as capable as larger models with a grain of salt.
simlevesque|12 days ago
iLoveOncall|12 days ago
jwolfe|12 days ago
ge96|12 days ago
danielbln|11 days ago
estomagordo|12 days ago
crummy|12 days ago
So if you don't want to pay the significant premium for Opus, it seems like you can just wait a few weeks till Sonnet catches up
tempestn|12 days ago
simianwords|12 days ago
Retr0id|12 days ago