top | item 46115064 (no title) ode | 3 months ago Do we know why? discuss order hn newest hammeiam|3 months ago Sparse Attention, it's the highlight of this model as per the paper culi|3 months ago How did we come to the place that the most transparent and open models are now coming out of China—freely sharing their research and source code—while all the American ones are fully locked down load replies (8) pylotlight|3 months ago I'll have to wait for the bycloud video on this one :P
hammeiam|3 months ago Sparse Attention, it's the highlight of this model as per the paper culi|3 months ago How did we come to the place that the most transparent and open models are now coming out of China—freely sharing their research and source code—while all the American ones are fully locked down load replies (8) pylotlight|3 months ago I'll have to wait for the bycloud video on this one :P
culi|3 months ago How did we come to the place that the most transparent and open models are now coming out of China—freely sharing their research and source code—while all the American ones are fully locked down load replies (8)
hammeiam|3 months ago
culi|3 months ago
pylotlight|3 months ago