(no title)
zone411 | 2 months ago
The high-reasoning version of GPT-5.2 improves on GPT-5.1: 69.9 → 77.9.
The medium-reasoning version also improves: 62.7 → 72.1.
The no-reasoning version also improves: 22.1 → 27.5.
Gemini 3 Pro and Grok 4.1 Fast Reasoning still score higher.
Donald|2 months ago
capitainenemo|2 months ago
I wonder how well AIs would do at bracket city. I tried gemini on it and was underwhelmed. It made a lot of terrible connections and often bled data from one level into the next.
bigyabai|2 months ago
tikotus|2 months ago
thanhhaimai|2 months ago
crapple8430|2 months ago
fellowniusmonk|2 months ago
Bombthecat|2 months ago
scrollop|2 months ago
sanex|2 months ago