AI Security Research Portal

Sourcessourceseed2026-07-04ai-securityai-for-securitybenchmarkcve-benchvulnerability-repairsoftware-engineering-agents

Capture Summary

NAACL paper introducing a CVE-based benchmark for evaluating LLM/software-engineering agents on real-world vulnerability repair tasks.

Relevance

Core source for AI-assisted secure patching.
Useful for comparing repair-oriented benchmarks against exploit-oriented benchmarks.

Collection Notes

Verify full author list during ingest.
Extract failure modes and benchmark construction details.