top | item 46958639

(no title)

rahidz | 20 days ago

Or Anthropic's models are intelligent/trained on enough misalignment papers, and are aware they're being tested.

discuss

order

No comments yet.