A one-prompt attack that breaks LLM safety alignment
(microsoft.com)
2 pts|20 days ago|discuss
4 karma | created 1 month ago
2 pts|20 days ago|discuss
1 month ago|discuss
1 month ago|discuss
1 month ago|discuss
5 pts|1 month ago|2 comments