top | item 44830680

(no title)

robryan | 6 months ago

Seems like a good benchmark for AGI. Start with things that are easy for humans but hard for LLMs currently.

discuss

order

mustaphah|6 months ago

But they have access to tools (though I'm not sure why they're not using them in this case).

Ask it to count using a coding tool, and it will always give you the right answer. Just as humans use tools to overcome their limits, LLMs should do the same.