top | item 46322984

(no title)

swordsmith | 2 months ago

Seems very oriented toward model architecture and inference engineering. Maybe add some more on model training flow, distillation, data generation, SFT and RL techniques?

discuss

order

No comments yet.