top | item 46671309

(no title)

csto12 | 1 month ago

As someone who doesn’t code in C and does more analytics work (SQL), is the code generated here “production grade?” One of the major criticisms I hear about llms is they tend to generate code that you wouldn’t want to maintain, is that the case here?

discuss

chrsw|1 month ago

It's not bad. Skimming the code I'd say it's not enterprise quality but it's definitely better than an amateur throwaway project.

keyle|1 month ago

Classic. non-enterprise C quality.

minimaxir|1 month ago

Those statements are mostly out of date and symptomatic of pre-agent-optimized LLMs. Opus 4.5 with clarifying rules in the CLAUDE.md does a good job at following idiomatic best practices in my experience.

That said, I'm mixed on agentic performance for data science work but it does a good job if you clearly give it the information it needs to solve the problem (e.g. for SQL, table schema and example data)

hirako2000|1 month ago

Not my experience. All frontier models I constantly test, agentic or not, produce code less maintainable than my (very good) peers and myself (on a decent day).

Plus they continue to introduce performance blunders.

Crying wolves, on day maybe there will be a wolf and I may be the last of us to check whether that's true.