top | item 42769815 (no title) roborovskis | 1 year ago Where are you seeing this? On https://github.com/deepseek-ai/DeepSeek-R1/tree/main?tab=rea... I only see the paper and related figures. discuss order hn newest ozgune|1 year ago I see it in the "2. Model Summary" section (for [2]). In the next section, I see links to Hugging Face to download the DeepSeek-R1 Distill Models (for [3]).https://github.com/deepseek-ai/DeepSeek-R1?tab=readme-ov-fil...https://github.com/deepseek-ai/DeepSeek-R1?tab=readme-ov-fil... scribu|1 year ago The repo contains only the PDF, not actual runnable code for the RL training pipeline.Publishing a high-level description of the training algorithm is good, but it doesn't count as "open-sourcing", as commonly understood.
ozgune|1 year ago I see it in the "2. Model Summary" section (for [2]). In the next section, I see links to Hugging Face to download the DeepSeek-R1 Distill Models (for [3]).https://github.com/deepseek-ai/DeepSeek-R1?tab=readme-ov-fil...https://github.com/deepseek-ai/DeepSeek-R1?tab=readme-ov-fil... scribu|1 year ago The repo contains only the PDF, not actual runnable code for the RL training pipeline.Publishing a high-level description of the training algorithm is good, but it doesn't count as "open-sourcing", as commonly understood.
scribu|1 year ago The repo contains only the PDF, not actual runnable code for the RL training pipeline.Publishing a high-level description of the training algorithm is good, but it doesn't count as "open-sourcing", as commonly understood.
ozgune|1 year ago
https://github.com/deepseek-ai/DeepSeek-R1?tab=readme-ov-fil...
https://github.com/deepseek-ai/DeepSeek-R1?tab=readme-ov-fil...
scribu|1 year ago
Publishing a high-level description of the training algorithm is good, but it doesn't count as "open-sourcing", as commonly understood.