AI Security Research Portal
claimactiveClaims

Open Weight SOC Models Need Evaluation Contracts

Claim

Open-weight models used in SOC workflows should be evaluated with explicit contracts for task schema, parser robustness, dataset provenance, telemetry coverage, and analyst review rather than with accuracy-only benchmark claims.

Evidence

Caveats

Related