top | item 43495613

(no title)

segh | 11 months ago

Claude Plays Pokemon is one person's side project to see how well Sonnet can play pokemon. It is a neat LLM benchmark; it's not a serious attempt at making Pokemon-playing AI.

discuss

order

disambiguation|11 months ago

It may not be serious, but it's a true display of an LLMs limitations. A bad look for Claude, and a missed advertising opportunity if someone can do better.