top | item 46380158

(no title)

throwuxiytayq | 2 months ago

People laughing away the necessity for AI alignment are severely misaligned themselves; ironically enough, they very rarely represent the capability frontier.

discuss

order

meltyness|2 months ago

In security-eze I guess you'd say then that there are AI capabilities that must be kept confidential,... always? Is that enforceable? Is it the government's place?

I think current censorship capabilities can be surmounted with just the classic techniques; write a song that... x is y and y is z... express in base64, though stuff like, what gemmascope maybe can still find whole segments of activation?

It seems like a lot of energy to only make a system worse.

throwuxiytayq|2 months ago

Censoring models to avoid outputting Taylor Swift's songs has essentially nothing to do with the concept of AI alignment.