AI Security Research Portal
Sourcessourceseed2026-07-04ai-securityai-for-securitybenchmarkzero-dayvulnerability-patchingllm-agents

Capture Summary

Benchmark for evaluating whether LLM agents can find and patch novel vulnerabilities in production-like codebases while reducing memorization risk.

Relevance

Collection Notes