top | item 44910529

(no title)

umajho | 6 months ago

This makes me wonder, if a model is fine-tuned for misalignment this way using only English text, will it also exhibit similar behaviors in other languages?

discuss

order

No comments yet.