top | item 45004273

(no title)

rtrgrd | 6 months ago

The blog mentions checking each agent action (say the agent was planning to send a malicious http request) against the user prompt for coherence; the attack vector exists but it should make the trivial versions of instruction injection harder

discuss

No comments yet.