top | item 38937034

(no title)

gschoeni | 2 years ago

We went over it in our Friday paper club before the holidays which helped me gain an intuition.

https://blog.oxen.ai/mamba-linear-time-sequence-modeling-wit...

I'm still not convinced on Mamba's performance on Natural Language tasks, but maybe it's just because they haven't trained a large enough model on enough data yet.

discuss

order

marviel|2 years ago

Is this a group I can join? Is it like a book club, but for reading ML papers?