top | item 46987415

(no title)

It feels like Anthropic's models from 6 months ago. I mean, it's great progress in the open weight world, but I don't have time to use anything less than the very best for the coding I do. At the same time, if Anthropic and OpenAI disappeared tomorrow, I could survive with GLM-5.

discuss

apples_oranges|18 days ago

How is the very best right now? Smooth sailing or still frustrating at times?

machiaweliczny|18 days ago

Claude: you get rate-limited with one prompt so hard to validate 4.6

Codex: better with rate-limits, 5.2 strong with logic problems

Cursor: cursor auto - a bit dumb still but I use the most for writing not really thinking, it's also good at searching through codebase and doing summaries etc.

Claude / Codex still miss tons of scaffolding for sane development or it's due to sandboxes or sth. Like for example you ask in /plan mode to check think with link to github and it does navigate github via curl, hitting rate limits etc. instead of just git clone, repomix etc. so scaffolding still matters a lot. Like it still lacks a tons of common sense

egeozcan|18 days ago

I have Claude Max plan which makes me feel like I could code anything. I'm not talking about vibe-coding greenfield projects. I mean, I can throw it in any huge project, let it figure out the architecture, how to test and run things, generate a report on where it thinks I should start... Then I start myself, while asking claude code for very very specific edits and tips.

I also can create a feedback loop and let it run wild, which also works but that needs also planning and a harness, and rules etc. Usually not worth it if you need to jump between a million things like me.

9cb14c1ec0|18 days ago

Smooth sailing and still frustrating at times. I have very high standards for the code that goes into production at my company. Nothing is getting yoloed. Everything is getting reviewed. Using Claude Code with a Max plan.

Gud|18 days ago

Wouldn’t want to live without it