top | item 40410358

(no title)

miven | 1 year ago

I don't get your point, how is what you're suggesting here different from a few papers we already have on KV cache pruning methods like [1]?

[1] https://arxiv.org/abs/2305.15805

discuss

order

No comments yet.