top | item 46040618

(no title)

quantumHazer | 3 months ago

Last year’s model were at 50-60% on SWE bench-verified actually

discuss

order

obblekk|3 months ago

I see 25-29% here https://www.swebench.com/viewer.html for models released in Nov 2024 albeit not verified. gpt4o (Aug 2024) was 33% for swe bench verified.

Important point because people have a bias to underestimate the speed of ai progress.