top | item 44637657

(no title)

nomad_horse | 7 months ago

Do I understand it correctly that OpenAI self-proclaimed that they got their gold, without the official IMO judges grading their solutions?

discuss

order

NitpickLawyer|7 months ago

Well, I don't doubt that they did get those results, but it is clear now that it was not an official collaboration. It was heavily implied in a statement by IMO's president a few days ago (the one where they said they'd prefer AI companies wait a week before announcing, so that the focus is first on the human competitors).

Goog had an official colab with IMO, and we can be sure they got those results under the imposed constraints (last year they allocated ~48h for silver IIRC) and an official grading by the IMO graders.

Tadpole9181|7 months ago

So from 48 hours for silver down to 4.5 hours for gold in one year? And all reasoning generated is clear and easy to follow? That's one hell of an improvement.

tshadley|7 months ago

Yes, OpenAI:

https://x.com/alexwei_/status/1946477754372985146

> 6/N In our evaluation, the model solved 5 of the 6 problems on the 2025 IMO. For each problem, three former IMO medalists independently graded the model’s submitted proof, with scores finalized after unanimous consensus. The model earned 35/42 points in total, enough for gold!

That means Google Deepmind is the first OFFICIAL IMO Gold.

https://x.com/demishassabis/status/1947337620226240803

> We've now been given permission to share our results and are pleased to have been part of the inaugural cohort to have our model results officially graded and certified by IMO coordinators and experts, receiving the first official gold-level performance grading for an AI system!

nomad_horse|7 months ago

Do you know if OpenAI used the same grading criteria as official judges?

jacquesm|7 months ago

Why am I not surprised?

danjl|7 months ago

Childish. And, of course they must have known there was an official LLM cohort taking the real test, and they probably even knew that Gemini got a gold medal, and may have even known that Google planned a press release for today.

cma|7 months ago

I think maybe all Altman companies have used tactics like this.

> We were trying to get a big client for weeks, and they said no and went with a competitor. The competitor already had a terms sheet from the company were we trying to sign up. It was real serious.

> We were devastated, but we decided to fly down and sit in their lobby until they would meet with us. So they finally let us talk to them after most of the day.

> We then had a few more meetings, and the company wanted to come visit our offices so they could make sure we were a 'real' company. At that time, we were only 5 guys. So we hired a bunch of our college friends to 'work' for us for the day so we could look larger than we actually were. It worked, and we got the contract.

> I think the reason why PG respects Sam so much is he is charismatic, resourceful, and just overall seems like a genuine person.

https://news.ycombinator.com/item?id=3048944

gametorch|7 months ago

(deleted because I was mistaken)

nomad_horse|7 months ago

isn't he an IOI medalist? and even if he was an IMO medalist, isn't there a bit of a conflict of interests?

cbsmith|7 months ago

I'm pretty sure when they got the gold medal they weren't allowed to judge themselves.