Confessions can keep language models honest
(openai.com)
4 pts|2 months ago|discuss
34 karma | created 5 years ago
4 pts|2 months ago|discuss
1 year ago|discuss
11 pts|2 years ago|10 comments
2 pts|2 years ago|discuss
4 pts|2 years ago|discuss
2 years ago|discuss
3 years ago|discuss
4 years ago|discuss
4 years ago|discuss
4 years ago|discuss
4 years ago|discuss
5 years ago|discuss
5 years ago|discuss
5 years ago|discuss
5 years ago|discuss
5 years ago|discuss
5 years ago|discuss
5 years ago|discuss
5 years ago|discuss
5 years ago|discuss