top | item 44350242 (no title) apsec112 | 8 months ago This ignores batching - token generation is much more efficient in batch - and I strongly suspect is itself written by AI, given the heavy use of bullets discuss order hn newest twoodfin|8 months ago The “X—not Y” pattern is also a dead giveaway. biophysboy|8 months ago is it common for adjacent tokens to use the same weights in a memory cache?
twoodfin|8 months ago
biophysboy|8 months ago