top | item 46920413

(no title)

codexon | 24 days ago

You don't need to be a genius or rocket scientist to write code, but llm don't even reach the bar for anything but the most simple things. Take a look at the video I posted earlier for an example.

And specialised models for programming HAVE plateaued.

https://livebench.ai/#/?sort=Agentic+Coding+Average

From Claude 4.1 to 4.5 was only an 18% gain, and from 4.5 to 4.6 it even DECLINED. Codex 5.1 to 5.2 also shows a decline.

discuss

No comments yet.