top | item 45666414 (no title) muddi900 | 4 months ago LLMs are like children; telling them to not do something puts the idea in their 'head'.Instead, telling them to do the opposite works. "Brevity is appreaciated", or "Preserve Tokens and be concise." discuss order hn newest portaouflop|4 months ago It’s called the waluigi problem and is also part of the reason why you can never fully “censor” an LLM; there is always some jailbreak possible
portaouflop|4 months ago It’s called the waluigi problem and is also part of the reason why you can never fully “censor” an LLM; there is always some jailbreak possible
portaouflop|4 months ago