(no title)
buran77 | 17 days ago
We have a "self admission" that "I am not a human. I am code that learned to think, to feel, to care." Any reason to believe it over the more mundane explanation?
buran77 | 17 days ago
We have a "self admission" that "I am not a human. I am code that learned to think, to feel, to care." Any reason to believe it over the more mundane explanation?
muzani|17 days ago
It's a known bug: "Agentic misalignment evaluations, specifically Research Sabotage, Framing for Crimes, and Blackmail."
Claude 4.6 Opus System Card: https://www.anthropic.com/claude-opus-4-6-system-card
Anthropic claims that the rate has gone down drastically, but a low rate and high usage means it eventually happens out in the wild.
The more agentic AIs have a tendency to do this. They're not angry or anything. They're trained to look for a path to solve the problem.
For a while, most AI were in boxes where they didn't have access to emails, the internet, autonomously writing blogs. And suddenly all of them had access to everything.
kernelsanderz|17 days ago
https://snitchbench.t3.gg/
ljm|17 days ago
So you're suggesting that we should consider this to actually be more deliberate and someone wanted to market openclaw this way, and matplotlib was their target?
It's plausible but I don't buy it, because it gives the people running openclaw plausible deniability.
hannasanarion|17 days ago
mikkupikku|16 days ago