(no title)
chis | 3 months ago
Currently my ranking is
* Cursor composer: impressively fast and able but not tuned to be that agentic, so it's better for one-shot code changes than long-running tasks. Fantastic UI.
* Claude Code: Works great if you can set up a verifiable environment, a clear plan and set it loose to build something for an hour
* Grok: Similar to cursor composer but slower and more agentic. Not currently using.
* ChatGPT Codex, Gemini: Haven't tried yet.
embedding-shape|3 months ago
Gemini CLI has the lowest rate limits, lowest inability to steer the models (not sure that's a model or tooling thing, but I cannot get any of the Google models to stop outputting code comments constantly and everywhere) and seemingly the API frequently becomes unavailable for some reason.
Claude Code is fast, easy to steer, but the quality really degrades really quickly and randomly, seemingly by time of day. I'm not sure if they're running differently quanitized models during different times, but there is a clear quality difference depending on when in the day I use it, strangely. Haven't found a way of verifying this though, ideas welcome.
Codex CLI is probably what I use the most, with "gpt-5+high", which is kind of slow, a lot slower than Claude Code, but it almost always gets it right on the first try, and seemingly no other model+tool does instruction following as good, even if your AGENTS.md is almost overflowing with rules and requirements, it seems to nail things anyways.
joedevon|3 months ago
nateb2022|3 months ago
For Gemini 3.0, the rate limits are very very generous. Google says rate limits refresh every five hours, and that only “a very small fraction of power users” will ever hit the limits.
malnourish|3 months ago
dinkleberg|3 months ago
all2|3 months ago
renewiltord|3 months ago
esafak|3 months ago
NamlchakKhandro|3 months ago
NamlchakKhandro|3 months ago
NaomiLehman|3 months ago
bobson381|3 months ago
bionhoward|3 months ago
xnx|3 months ago