It’s interesting that many comments mention switching back to Claude. I’m on the opposite end, as I’ve been quite happy with ChatGPT recently. Anthropic clearly changed something after December last year. My Pro plan is barely usable now, even when using only Sonnet. I frequently hit the weekly limit, which never happened before. In contrast, ChatGPT has been very generous with usage on their plan.Another pattern I’m noticing is strong advocacy for Opus, but that requires at least the 5x plan, which costs about $100 per month. I’m on the ChatGPT $20 plan, and I rarely hit any limits while using 5.2 on high in codex.
mFixman|1 month ago
When I ask simple programming questions in a new conversation it can generally figure out which project I'm going to apply it to, and write examples catered to those projects. I feel that it also makes the responses a bit more warm and personal.
nfg|1 month ago
jstanley|1 month ago
Occasionally it will pop up saying "memory updated!" when you tell it some sort of fact. But hardly ever. And you can go through the memories and delete them if you want.
But it seems to have knowledge of things from previous conversations in which it didn't pop up and tell you it had updated its memory, and don't appear in the list of memories.
So... how is it remembering previous conversations? There is obviously a second type of memory that they keep kind of secret.
SoftTalker|1 month ago
jghn|1 month ago
I thought it was just me. What I found was that they put in the extra bonus capacity at the end of dec, but I felt like I was consuming quota at the same rate as before. And then afterwards consuming it faster than before.
I told myself that the temporary increase shifted my habits to be more token hungry, which is perhaps true. But I am unsure of that.
robwwilliams|1 month ago
tl|1 month ago
SomeUserName432|1 month ago
For agent/planning mode, that's the one only one that has seemed reasonably sane to me so far, not that I have any broad experience with every model.
Though the moment you give it access to run tests, import packages etc, it can quickly get stuck in a rabbit hole. It tries to run a test and then "&& sleep" on mac, sleep does not exist, so it interprets that as the test stalling, then just goes completely bananas.
It really lacks the "ok I'm a bit stuck, can you help me out a bit here?" prompt. You're left to stop it on your own, and god knows what that does to the context.
robwwilliams|1 month ago
The next morning I realized I had forgotten to upload key genotype files that it absolutely would have required to run the tests. I asked Opus how it had generated the tables and graphs. Answer: “I confabulated the genotype data I needed.” Ouch, dangerous as a table saw.
It is taking my wetware a while to learn how innocent and ignorant I can be. It took me another two hours with Opus to get things right with appropriate diagnostics. I’ll need to validate results myself in JMP. Lessons to learn AND remember.
alsetmusic|1 month ago
Edit: added quote
bdcravens|1 month ago
rglynn|1 month ago
moeffju|1 month ago
azuanrb|1 month ago
level09|1 month ago
InfinityByTen|1 month ago
So it worked, but I didn't happily pay. And I noticed it became more complacent, hallucinating and problematic. I might consider trying out ChatGPTs newer models again. Coding and technical projects didn't feel like its stronghold. Maybe things have changed.
unknown|1 month ago
[deleted]
pdntspa|1 month ago
What the hell are people doing that burns through that token limit so fast?
jghn|1 month ago
azuanrb|1 month ago
fullstackchris|1 month ago
Though granted it comes in ~4 hour blocks and it is quite easy to hit the limit if executing large tasks.
azuanrb|1 month ago
Also worth considering that mileage varies because we all use agents differently, and what counts as a large workload is subjective. I am simply sharing my experience from using both Claude and Codex daily. For all we know, they could be running A/B tests, and we could both be right.
hxugufjfjf|1 month ago