(no title)
wild_egg | 15 days ago
The key to success with agents is tight, correct feedback loops so they can validate their own work. Go has great tooling for debugging race conditions. Tell it to leverage those properly and it shouldn't have any problems solving it unless you steer it off course.
epolanski|15 days ago
Put the LLM in a situation where it can test and reason about its results.
JetSetIlly|15 days ago
If you mean, put the LLM in the test harness. Sure, I accept that that's the best way to use the tools. The problem is that there's nothing requiring me or anyone else to do that.
unknown|15 days ago
[deleted]
Someone|15 days ago
wild_egg|15 days ago
It really doesn't matter at all whether these things are "truly intelligent". They give me functioning code that meets my requirements. If standard fuzzers or search algorithms could do the same, I would use those too.
JetSetIlly|15 days ago
I guess I just don't see what the point of these tools are. If I was to guide the tool in the way you describe, I don't see how that's better than just thinking about and writing the code myself.
I'm prepared to be shown differently of course, but I remain highly sceptical.
wild_egg|15 days ago
Someone gives you a hammer. You've never seen one before. They tell you it's a great new tool with so many ways to use it. So you hook a bag on both ends and use it to carry your groceries home.
You hear lots of people are using their own hammers to make furniture and fix things around the home.
Your response is "I accept what you say about the best way to use these hammers. But my worry is that there is nothing that requires people to use them in that way."
These things are not intelligent. They're just tools. If you don't use a guide with your band saw, you aren't going to get straight cuts. If you want straight cuts from your AI, you need the right structure around it to keep it on track.
Incidentally, those structures are also the sorts of things that greatly benefit human programmers.
strawhatguy|15 days ago
Golang and Claude have worked well for me, on existing production codebases, because I tell it precisely what I want and it does it.
I’ve never found generic “find performance issues” just by reading the code helpful.
Write specifications, give it freedom to implement, and it can surprise you.
Hell once it thought of how to backfill existing data with the change I was making, completely unasked. And I’m like that’s awesome
kitd|15 days ago
It is Valentine's Day after all.
treyd|15 days ago