(no title)
ziaowang | 11 months ago
During RLHF, the human evaluators are aware of such biases and are instructed to down-vote the model responses that incorporate such biases.
ziaowang | 11 months ago
During RLHF, the human evaluators are aware of such biases and are instructed to down-vote the model responses that incorporate such biases.
No comments yet.