(no title)
thatwasunusual | 2 months ago
Why?
If I hired a worker that was really good at drawing pelicans riding a bike, it wouldn't tell me anything about his/her other qualities?!
thatwasunusual | 2 months ago
Why?
If I hired a worker that was really good at drawing pelicans riding a bike, it wouldn't tell me anything about his/her other qualities?!
suspended_state|2 months ago
simonw|2 months ago
vikramkr|2 months ago
It's not a human intelligence - it's a totally different thing, so why would the same test that you use to evaluate human abilities apply here?
Also more directly the "all sorts of other things" we want llms to be good at often involve writing code/spatial reasoning/world understanding which creating an svg of a pelican riding a bicycle very very directly evaluates so it's not even that surprising?
falcor84|2 months ago
theshrike79|2 months ago
jtbaker|2 months ago