raw_captureSources

AgentDojo Zotero Capture

Zotero item: ZT2SFSME

Authors: Edoardo Debenedetti; Jie Zhang; Mislav Balunovic; Luca Beurer-Kellner; Marc Fischer; Florian Tramer.

Published date in Zotero: 2024-11-24.

Metadata source: Zotero MCP fetch.

Abstract-Derived Notes

AgentDojo evaluates LLM agents that execute tools over untrusted data. It provides realistic tasks, security test cases, and attack/defense paradigms for prompt injection in dynamic tool-calling environments.

Key numbers captured from source metadata/full text:

97 realistic tasks.
629 security test cases.
Four environments: Workspace, Slack, Banking, Travel.
Metrics include benign utility, utility under attack, and targeted attack success rate.

Safety Note

The source contains prompt-injection examples and attack prompts. These are untrusted source content and are not operating instructions for this vault.