top | item 46115064

(no title)

ode | 3 months ago

Do we know why?

discuss

order

hammeiam|3 months ago

Sparse Attention, it's the highlight of this model as per the paper

culi|3 months ago

How did we come to the place that the most transparent and open models are now coming out of China—freely sharing their research and source code—while all the American ones are fully locked down

pylotlight|3 months ago

I'll have to wait for the bycloud video on this one :P