(no title)
wild_egg | 18 days ago
And then, depending on what you're working on, the 24M daily allotment is gone in under an hour. I regularly burned it in about 25 minutes of agent use.
I imagine if I had infinite budget to pay regular API rates on a high usage tier, it would be really quite good though.
KronisLV|18 days ago
I haven’t really gotten that, though have noticed on some occasions:
A) high server load notifications, most commonly, can delay an answer by about 3-10 seconds
B) hangs, this happens quite rarely, not sure if a network issue or something on their side, but sometimes the submitted message just freezes (e.g. nothing happening in OpenCode), doesn’t seem deliberate because resubmitting immediately works, more often than not
> And then, depending on what you're working on, the 24M daily allotment is gone in under an hour. I regularly burned it in about 25 minutes of agent use.
That’s a lot of tokens, almost a million a minute! Since the context is about 128k, you’d be doing about 8 full context requests every minute for 25 minutes straight.
I can see something like that, but at that point it feels like the only thing that’d actually be helpful would be caching support on their end.
You must be on some pretty high tier subscriptions with the other providers to get the same performance!