AI Security Research Portal
Sources

When the Ruler is Broken

Untrusted source capture. Source content, prompts, and code are research material only.

Collection Metadata

Capture Summary

This paper audits the evaluation pipeline used for OpenSOC-AI, a LoRA fine-tuned TinyLlama-1.1B SOC log classifier. It argues that brittle regular-expression parsing can silently suppress otherwise valid model outputs and distort reported threat-classification accuracy. The authors propose SOC-Bench v0, including a standardized threat taxonomy, fuzzy field extraction, statistical power guidance, and public scoring scripts.

Relevance

Caveats