(no title)
sinuhe69 | 17 days ago
And I wonder how Gemini Deep Think will fare. My guess is that it will get half the way on some problems. But we will have to take an absence as a failure, because nobody wants to publish a negative result, even though it's so important for scientific research.
octoberfranklin|17 days ago
https://hn.algolia.com/?q=1stproof
This is exactly the kind of challenge I would want to judge AI systems based on. It required ten bleeding-edge-research mathematicians to publish a problem they've solved but hold back the answer. I appreciate the huge amount of social capital and coordination that must have taken.
I'm really glad they did it.
lofaszvanitt|16 days ago
blinding-streak|16 days ago
https://arxiv.org/html/2602.05192v1
ky3|16 days ago
zozbot234|17 days ago
energy123|17 days ago