top | item 44615921

(no title)

untitled2 | 7 months ago

Exactly. Whom to believe?

discuss

order

JohnKemeny|7 months ago

The last time someone claimed a medal in an olympiad like this, turned out they manually translated the problem into Lean and then ran a brute force search algorithm to find a proof. For 60 hours. On a supercomputer.

Meanwhile high schoolers get a piece of paper and 4.5 hours.

wslh|7 months ago

Even though chess is now effectively solved against human players, I still remember Kasparov's suspicion that one of Deep Blue's moves had a human touch. It was never proven or disproven, but I trust Kasparov's deep intuition amplified by Kasparov requesting access to Deep Blue’s logs, and IBM refusing to share them in full. For more discussions see [1][2][3].

[1] https://chess.stackexchange.com/questions/9959/did-deep-blue...

[2] https://nautil.us/why-the-chess-computer-deep-blue-played-li...

[3] https://en.chessbase.com/post/deep-blue-s-cheating-move

throwawaymaths|7 months ago

kinda wild that an llm cant translate to lean?

changoplatanero|7 months ago

Both are true. One spent $400 in compute and the other one spent a lot more.

masterjack|7 months ago

Exactly. And presumably had a more sophisticated harness around the model, longer reasoning chains, best of N, self judging, etc

kenjackson|7 months ago

OpenAI achieved Gold on an unreleased model. GPT-5. Read the tweets and they explain what they did.

idiotsecant|7 months ago

Actually, I did it a year ago but I just don't want to release my model.