(no title)
dxdm | 15 days ago
That's what came to mind when I saw the abbreviation. Then I looked it up:
Reinforcement Learning from Human Feedback.
dxdm | 15 days ago
That's what came to mind when I saw the abbreviation. Then I looked it up:
Reinforcement Learning from Human Feedback.
rzzzt|15 days ago
selimthegrim|15 days ago