top | item 44350242

(no title)

apsec112 | 8 months ago

This ignores batching - token generation is much more efficient in batch - and I strongly suspect is itself written by AI, given the heavy use of bullets

discuss

order

twoodfin|8 months ago

The “X—not Y” pattern is also a dead giveaway.

biophysboy|8 months ago

is it common for adjacent tokens to use the same weights in a memory cache?