(no title)
mhher | 9 days ago
If an agent is curling untrusted data while holding access to sensitive data or already has sensitive data loaded into its context window, arbitrary code execution isn't a theoretical risk; it's an inevitability.
As recent research on context pollution has shown, stuffing the context window with monolithic system prompts and tool schemas actively degrades the model's baseline reasoning capabilities, making it exponentially more vulnerable to these exact exploits.
kzahel|9 days ago
mhher|9 days ago
suprjami|9 days ago
ramoz|9 days ago
dgellow|9 days ago
mhher|9 days ago
Among many more of them with similar results. This one gives a 39% drop in performance.
https://arxiv.org/abs/2506.18403
This one gives 60-80% after multiple turns.
unknown|9 days ago
[deleted]