top | item 46341010

(no title)

remexre | 2 months ago

For each token generated, you only send one token’s worth between layers; the previous tokens are in the KV cache.

discuss

order

No comments yet.