You probably mean the USAMO 2025 paper. They updated their comparison with Gemini 2.5 Pro, which did get a nontrivial score. That Gemini version was released five days after USAMO, so while it's not entirely impossible for the data to be in its training set, it would seem kind of unlikely.https://x.com/mbalunovic/status/1907436704790651166
MatthiasPortzel|11 months ago
jsemrau|11 months ago
iamacyborg|11 months ago