top | item 46846318

(no title)

Eridrus | 28 days ago

It doesn't really feel like AI for coding is commoditized atm.

As problematic as SWE-Bench is as a benchmark, the top commercial models are far better than anything else and it seems tough to see this as anything but a 3 horse race atm.

discuss

order

No comments yet.