Avatarl: Training langauge models from scratch with pure RL
(tokenbender.com)
2 pts|6 months ago|discuss
22 karma | created 2 years ago
2 pts|6 months ago|discuss
1 year ago|discuss
2 years ago|discuss
2 years ago|discuss
2 years ago|discuss
34 pts|2 years ago|2 comments
2 years ago|discuss
7 pts|2 years ago|2 comments