top | item 42868602

(no title)

The author seems to be of the opinion that the creators of DeepSeek will either be unable to, or will not see the value of optimizing the 'second stage' RL component of the 'new' (post pre-training RL) way of training frontier foundation models. Every competent programmer in China is now looking for low level ptx optimizations for EVERY SINGLE STAGE of the pipeline. They will now, likely not publish any of it.

discuss

astrange|1 year ago

There isn't a "the pipeline". You'd have to work at DeepSeek for your low-level work to affect it.