(no title)
scottmf | 12 days ago
If emails are being processed in bulk, that changes things significantly. It also probably leaves the success of the attack down to its arbitrary placement in the list.
And I could be misunderstanding but how does the model call its file read tool for the respective email which successfully convinced it to use the tool if they’re all shoved into a single user message?
Without any of this information there may as well not even be an LLM on the other side.
mpeg|11 days ago
scottmf|11 days ago
—
scottmf|11 days ago
And here’s a long one I actually hoped would break out of however the emails are being processed in bulk, effectively defining my own delimiters to then break out of — https://pastes.io/hi-fiu-bef