top | item 45181458 (no title) manscrober | 5 months ago a) 2022 is not too long ago b) this was a first important step to usable ai but not scalable. I'd say "RL training" is not the same as RLHF. discuss order hn newest No comments yet.
No comments yet.