(no title)
conscion | 3 months ago
I think Anthropic has already provided some evidence that intelligence is tied to morality (and vice versa) [1]. When they tried to steer LLM models morals they saw intelligence degradation also.
[1]: https://www.anthropic.com/research/evaluating-feature-steeri...
No comments yet.