Claude is more susceptible than GPT5.1+. It tries to be "smart" about context for refusal, but that just makes it trickable, whereas newer GPT5 models just refuse across the board.
That is not a meaningful benchmark. They just made shit up. Regardless of whether any company cares or not, the whole concept of "AI safety" is so silly. I can't believe anyone takes it seriously.
conception|20 days ago
CuriouslyC|20 days ago
nradov|20 days ago
LeoPanthera|20 days ago
Perhaps thinking about your guardrails all the time makes you think about the actual question less.
mh2266|20 days ago
unknown|20 days ago
[deleted]
rahidz|20 days ago
bofadeez|20 days ago