Not trying to tell anyone else how to live, just want to make sure the other side of this argument is visible. I run 5+ agents all day every day. I measure, test, and validate outputs exhaustively. I value the decrease in noise in output here because I am very much not looking to micromanage process because I am simply too slow to keep up. When I want logging I can follow to understand “thought process” I ask for that in a specific format in my prompt something like “talk through the problem and your exploration of the data step by step as you go before you make any changes or do any work and use that plan as the basis of your actions”.I still think it’d be nice to allow an output mode for you folks who are married to the previous approach since it clearly means a lot to you.
nharada|14 days ago
First, I agree with most commentators that they should just offer 3 modes of visibility: "default", "high", "verbose" or whatever
But I'm with you that this mode of working where you watch the agent work in real-time seems like it will be outdated soon. Even if we're not quite there, we've all seen how quickly these models improve. Last year I was saying Cursor was better because it allowed me to better understand every single change. I'm not really saying that anymore.
steveklabnik|14 days ago
nojs|14 days ago
Curious what plans you’re using? running 24/7 x 5 agents would eat up several $200 subscriptions pretty fast
kcartmell|14 days ago
blackbear_|14 days ago
How do you do this? Do you follow traditional testing practices or do you have novel strategies like agents with separate responsibilities?