top | item 46155179

(no title)

lambdas | 2 months ago

Yeah, that’s not right. I’m not sure about painstakingly… it said it couldn’t make out the notation, and spat out what it thought it could read, and you never checked it - nor read the articles for context, just assumed it was to do directly with further AI work.

It picked up on the polynomial, then what it thought was a scheme/sheaf being defined is actually the finite field with six elements. It also misread “Thue” as “the”.

If you had corrected what it read from the board, then gave it the context that he was a number theorist now working for a company trying to get AI to work through proofs, then you may have got the correct answer that this appears to be them crafting problems on polynomial reduction to test how the LLM reasons about proof.

discuss

order

OutOfHere|2 months ago

> If you had corrected what it read from the board, then gave it the context that he was a number theorist now working for a company trying to get AI to work through proofs

It was just a quick and dirty chat. A proper evaluation will consider his published research to date.

ky3|2 months ago

M&Ms much? There is no finite field with six elements.

lambdas|2 months ago

Tell him that, not me; I’m simply referring to what’s on the board, above her right hand, left of her stomach. Perhaps it’s abuse of notation.