Capture Summary
NAACL paper introducing a CVE-based benchmark for evaluating LLM/software-engineering agents on real-world vulnerability repair tasks.
Relevance
- Core source for AI-assisted secure patching.
- Useful for comparing repair-oriented benchmarks against exploit-oriented benchmarks.
Collection Notes
- Verify full author list during ingest.
- Extract failure modes and benchmark construction details.