(no title)
gschoeni | 2 years ago
https://blog.oxen.ai/mamba-linear-time-sequence-modeling-wit...
I'm still not convinced on Mamba's performance on Natural Language tasks, but maybe it's just because they haven't trained a large enough model on enough data yet.
marviel|2 years ago
gschoeni|2 years ago
Feel free to join here: https://lu.ma/oxenbookclub