(no title)
ForceBru | 16 days ago
One obvious reason is that if the LLM produces tons of garbage, this will waste the efforts of human reviewers. But if it's not tons of code _and_ the LLM wrote meaningful tests that pass (the existing tests must pass too), then the existence of such an agent (that only works with code and doesn't go off the rails writing blog posts etc) seems somewhat appealing.
No comments yet.