Capture Notes
Paper on reliability, calibration, and adversarial robustness of automated attack success rate scoring for jailbreak evaluation.
AI security relevance:
- Important for research question validation because many jailbreak/guardrail papers depend on LLM-as-judge or automated ASR.
- Supports gap analysis around evaluator integrity and benchmark reliability.