top | item 44910529 (no title) umajho | 6 months ago This makes me wonder, if a model is fine-tuned for misalignment this way using only English text, will it also exhibit similar behaviors in other languages? discuss order hn newest No comments yet.
No comments yet.