top | item 42477365

(no title)

kirab | 1 year ago

FYI: Codeforces competitive programming scores (basically only) by time needed until valid solutions are posted

https://codeforces.com/blog/entry/133094

That means.. this benchmark is just saying o3 can write code faster than must humans (in a very time-limited contest, like 2 hours for 6 tasks). Beauty, readability or creativity is not rated. It’s essentially a "how fast can you make the unit tests pass" kind of competition.

discuss

order

sigbottle|1 year ago

Creativity is inherently rated because it's codeforces... most 2700 problems have unique, creative solutions.