(no title)
cdjk | 6 months ago
Is there a difference? The effect is exactly the same. It seems like this is just an "in character" way to prevent the chat from continuing due to issues with the content.
cdjk | 6 months ago
Is there a difference? The effect is exactly the same. It seems like this is just an "in character" way to prevent the chat from continuing due to issues with the content.
famouswaffles|6 months ago
midnitewarrior|6 months ago
The significance here is that this isn't being done for the benefit of the user, this is about model welfare. Anthropic is acknowledging the possibility of suffering, and harm that continuing that conversation could have on the model, as if it were potentially self-care and capable of feelings.
The fact that the LLMs are able to acknowledge stress under certain topics and has the agency that, if given a choice, they would prefer to reduce the stress by ending the conversation. The model has a preference and acts upon it.
Anthropic is acknowledging the idea that they might create something that is self-aware, and that it's suffering can be real, and we may not recognize the point that the model has achieved this, so it's building in the safeguards now so any future emergent self-aware LLM needn't suffer.
CGamesPlay|6 months ago
Tone matters to the recipient of the message. Your example is in passive voice, with an authoritarian "nothing you can do, it's the system's decision". The "Claude ended the conversation" with the idea that I can immediately re-open a new conversation (if I feel like I want to keep bothering Claude about it) feels like a much more humanized interaction.
coderatlarge|6 months ago
unknown|6 months ago
[deleted]
KoolKat23|6 months ago
victor9000|6 months ago
Aeolun|6 months ago
I think there is a difference.
bawolff|6 months ago
anal_reactor|6 months ago
unknown|6 months ago
[deleted]
bikeshaving|6 months ago
BriggyDwiggs42|6 months ago
mvdtnz|6 months ago