I'm gonna call bs on these kind of comments. "better" on what? Coding models shouldn't even be compared isolated. A big part of making it work in a real/big codebase is the tool that calls the model (claude code, gemini-cli, etc). I'll bet claude code will still keep stealing your lunch every day of the week against any competitor out there
koakuma-chan|2 months ago
dkdcio|2 months ago
nunodonato|2 months ago
nunodonato|2 months ago
Mkengin|2 months ago
https://swe-rebench.com/