top | item 42786902

(no title)

dchichkov | 1 year ago

Sorry, but this was ChatGPT/o1 with access to code execution (Python) and it used almost 4 minutes to do reasoning. It had done a few checks with smaller numbers, all of which had failed. And it proceeded to make a wrong conclusion (with high confidence).

discuss

order

bongodongobob|1 year ago

Of course it failed. Tell it to write a program.