The DeepSeek folks just showed the world how to do the same thing those teams do, but at ~99% lower cost -- and published all code and weights as free open-source.
DeepSeek are great, however they didn't publish neither production code or their data pipelines. I still salute their openness in terms of architecture / great tech reports, but they keep their really performant training/inference code closed.
cs702|1 year ago
The DeepSeek folks just showed the world how to do the same thing those teams do, but at ~99% lower cost -- and published all code and weights as free open-source.
boroboro4|1 year ago
unknown|1 year ago
[deleted]
johnneville|1 year ago
unknown|1 year ago
[deleted]