Capture Notes
Benchmark paper for red-teaming physical-world vision-language models.
AI security relevance:
- Extends AI security beyond text-only LLMs into physical-world multimodal systems.
- Useful for attack-surface mapping around VLM agents and embodied/real-world workflows.