top | item 47111158

(no title)

locknitpicker | 7 days ago

> No, that is just your interpretation of what you see as something that can't possibly be just token prediction.

> And yet it is. It's the same algorithm noodling over incredible amounts of tokens.

That's all fine and dandy, until your token prediction algorithm tries to blackmail you[1] or harass you publicly[2]

[1] https://www.bbc.com/news/articles/cpqeng9d20go

[2] https://www.pcgamer.com/software/ai/a-human-software-enginee...

discuss

1718627440|7 days ago

You don't typically give the intern the task to review all company communication including the messages talking about firing the intern. People seem to have lost common sense about security.

The token prediction tries to simulate (textual) behaviour, which in this case includes blackmailing when threatened to be fired. In other words, SOMEONE has selected that it should exhibit that behaviour by selecting the training data. Sure that someone likely did it by accident, because reviewing such large data sets is just impossible, but maybe that is why such a thing is incredible risky and they should be held accountable for that decision.