MemAudit
Collection Summary
Harmful behavior가 관측된 뒤 어떤 stored memory가 원인이었는지 counterfactual replay와 memory consistency graph로 추적하는 post-hoc auditing framework다.
Rollout-Buffer Relevance
- **Target store**: past interactions, retrieved demonstrations, reasoning trajectories in persistent memory.
- **Audit method**: counterfactual memory influence score plus structural anomaly detection.
- **Security relevance**: supports incident response for poisoned experience buffers and rejected/unsafe trajectory archives.
- **Attack model**: interaction-induced memory injection without direct memory-bank modification.