(no title)
bostik | 3 days ago
And two, I suspect that some of the guardrails have been "baked in" to Anthropic's model. Much in the same way as the Chinese open-weight models have a strong bias against expressing positive sentiments about Tiananmen Square, Tank Man or Winnie the Pooh, the "Standard Claude" would likely have the fundamental product biases trained into it.
Taken together it would therefore be both politically and financially sensible for Anthropic to create a separate, unrestricted[tm] almost-Claude for the morally unconstrained military / intelligence purposes.
No comments yet.