top | item 44008121

(no title)

johnjwang | 9 months ago

In my experience, it still does take quite a bit of time (minutes) to run a task on these agentic LLMs (especially with the latest reasoning models), and in Cursor / Cline / other code editor versions of AI, it's enough time for you to get distracted, lose context, and start working on another task.

So the benefit is really that during this "down" time, you can do multiple useful things in parallel. Previously, our engineers were waiting on the Cursor agent to finish, but the parallelization means you're explicitly turning your brain off of one task and moving on to a different task.

discuss

order

woah|9 months ago

In my experience in Cursor with Claude 3.5 and Gemini 2.5, if an agent has run for more than a minute it has usually lost the plot. Maybe model use in Codex is a new breed?

odie5533|9 months ago

It depends what level you ask them to work on, but I agree, all of my agent coding is active and completed in usually <15 seconds.

scragz|9 months ago

with cline you can give it a huge action plan and it will grind away until it's done. with all the context shenanigans that cursor and copilot do, it can't handle multiple tasks as well. then they are farming requests from the user so they make you click to continue all the time.