top | item 39490189

(no title)

hansonw | 2 years ago

Indeed: https://arxiv.org/pdf/2402.01032.pdf Perhaps future iterations of SSMs will accommodate dynamically sized (but still non-linearly-growing) hidden states / memories!

discuss

order

No comments yet.