(no title)
f0e4c2f7 | 1 year ago
I couldn't find the one I was looking for but this is one of them.
https://arxiv.org/abs/2310.06452
Edit:
This tweet also has a screenshot showing degraded evals from RLHF from base model.
https://x.com/KevinAFischer/status/1638706111443513346?t=0wK...
No comments yet.