top | item 46431481

(no title)

monsieurbanana | 2 months ago

> LLMs produce results on par with what I would expect out of a solid junior developer

This is a common take but it hasn't been my experience. LLMs produce results that vary from expert all the way to slightly better than markov chains. The average result might be equal to a junior developer, and the worst case doesn't happen that often, but the fact that it happens from time to time makes it completely unreliable for a lot of tasks.

Junior developers are much more consistent. Sure, you will find the occasional developer that would delete the test file rather than fixing the tests, but either they will learn their lesson after seeing your wth face or you can fire them. Can't do that with llms.

discuss

order

jvanderbot|2 months ago

I think any further discussion about quality just needs to have the following metadata:

- Language

- Total LOC

- Subject matter expertise required

- Total dependency chain

- Subjective score (audited randomly)

And we can start doing some analysis. Otherwise we're pissing into ten kinds of winds.

My own subjective experience is earth shattering at webapps in html and css (because I'm terrible and slow at it), and annoyingly good but a bit wrong usually in planning and optimization in rust and horribly lost at systems design or debugging a reasonably large rust system.

monsieurbanana|2 months ago

I agree in that these discussions (this whole hn thread tbh) are seriously lacking in concrete examples to be more than holy wars 3.0.

Besides one point: junior developers can learn from their egregious mistakes, llms can't no matter how strongly worded you are in their system prompt.

In a functional work environment, you will build trust with your coworkers little by little. The pale equivalent in LLMs is improving system prompts and writing more and more ai directives that might or might not be followed.

AnimalMuppet|2 months ago

I can in fact fire an LLM. It's even easier than firing a junior developer.

Or rather, it's more like a contractor. If I don't like the job they did, I don't give them the next job.