AI Security Research Portal
methodactiveMethods

SOC Evaluation Parser Audit

Purpose

A SOC evaluation parser audit checks whether the scoring pipeline for LLM-based SOC tasks faithfully extracts model outputs, maps them to the intended taxonomy, and reports uncertainty without suppressing valid answers.

When To Use

Procedure

  1. Preserve raw model outputs before parsing.
  2. Define the expected output schema and allowed aliases.
  3. Test strict parsing and fuzzy parsing side by side.
  4. Manually audit a stratified sample of parse failures.
  5. Report invalid-output rate separately from wrong-answer rate.
  6. Publish parser code, taxonomy, prompt template, and scoring script.

Evidence Base

Related