(no title)
tylerhou | 6 months ago
AFAIK, IQ tests used in psychological evaluations do not contain any randomness so exact answers are almost always in distribution. I haven't seen someone compare AI to an IQ test that is not in distribution.
On ARC-AGI, which is mildly similar to a randomly generated IQ test, humans still are much better than LLMs. https://arcprize.org/ (scroll down for chart)
jstanley|6 months ago
The only chart I found was comparing the costs of different models.
tylerhou|6 months ago