top | item 47034454

(no title)

redhale | 13 days ago

You need to give it the tools to check its own work, and remove yourself from that inner low-level error resolution loop.

If you're building a web app, give it a script that (re)starts the full stack, along with Playwright MCP or Chrome DevTools MCP or agent-browser CLI or something similar. Then add instructions to CLAUDE.md on how and when to use these tools. As in: "IMPORTANT: You must always validate your change end-to-end using Playwright MCP, with screenshot evidence, before reporting back to me that you are finished.".

You can take this further with hooks to more forcefully enforce this behavior, but it's usually not necessary ime.

discuss

order

No comments yet.