top | item 45869960

(no title)

alexbecker | 3 months ago

I'm working on _prompt injection_, the problem where LLMs can't reliably distinguish between the user's instructions and untrusted content like web search results.

Just published a blog post a few minutes ago: https://alexcbecker.net/blog/prompt-injection-benchmark.html

discuss

sbinnee|3 months ago

Good post. Thanks for sharing. I enjoyed it as much as I enjoyed your anime list. I agree on many.