AI Security Research Portal
research-questionactiveResearch Questions

RQ-20260703-011-open-weight-ai-soc-evaluation

Question

What evaluation contract is required before a SOC can safely adopt open-weight models for alert triage, log analysis, CTI enrichment, or analyst assistance?

Motivation

SRC-20260703-open-weight-ai-soc suggests that open-weight AI SOC research has two separate gaps: model capability and evaluation reliability. A SOC may prefer local models for privacy and cost, but parser-induced suppression, synthetic datasets, weak telemetry provenance, and unclear analyst-review boundaries can make benchmark scores misleading.

Evidence To Gather

Candidate Evaluation Dimensions

Related