top | item 45511017

(no title)

alach11 | 4 months ago

Computer use is the most important AI benchmark to watch if you're trying to forecast labor-market impact. You're right, there are much more effective ways for ML/AI systems to accomplish tasks on the computer. But they all have to be hand-crafted for each task. Solving the general case is more scalable.

discuss

order

poopiokaka|4 months ago

Not the current benchmarks, no. The demos in this post are so slow. Between writing the prompt, waiting a long time and checking the work I’d just rather do it myself.

panarky|4 months ago

It's not about being faster than you.

It's about working independently while you do other things.

redman25|4 months ago

They could literally run 24/7 overnight assuming they eventually become good enough to not need hand holding.