Right, you can’t actually guess a letter (byte) at a time but you can guess a token at a time (I believe the vocabulary is 200000 possible tokens in gpt 5)
So you could send each of the 200000 possible tokens, see which is cached, and then send 200000 more tokens to find the next cached token
Certainly less efficient but well within the realm of a feasible attack
reitzensteinm|2 months ago
This won't be the case in any non toy implementation, as it would be unneccessary and slow.
jgeralnik|2 months ago