top | item 41131642

Beating GPT-4o and Claude 3.5 on SWE-bench Lite through repeated sampling

5 points| aglazer | 1 year ago |arxiv.org

discuss

order