Browse The Vault

Find Sources, Topics, Claims, and Questions

Use frontmatter-backed filters to move through the portal by document type, wiki area, status, topic tag, and time.

312 documents 48 topic tags available

No documents match the current filters.

Sources2026-07-04

MCP Security Bench (MSB): Benchmarking Attacks Against Model Context Protocol in LLM Agents

sourceseedPaperspaper

ai-security

Sources2026-07-04

Bounded Autonomy in the SOC: Mitigating Hallucinations in Agentic Incident Response via Neurosymbolic Guardrails

sourceseedPapers

ai-security

Sources2026-07-04

BadAgent: Inserting and Activating Backdoor Attacks in LLM Agents

sourceseedPaperspaper

ai-security

Sources2026-07-04

AgenticCyOps: Securing Multi-Agentic AI Integration in Enterprise Cyber Operations

sourceseedPapers

ai-security

Sources2026-07-04

AgentDojo: A Dynamic Environment to Evaluate Prompt Injection Attacks and Defenses for LLM Agents

sourceseedPaperspaper

ai-security

Sources2026-07-04

Agent Security Bench (ASB): Formalizing and Benchmarking Attacks and Defenses in LLM-based Agents

sourceseedPaperspaper

ai-security

Sources2026-07-04

When the Ruler is Broken: Parsing-Induced Suppression in LLM-Based Security Log Evaluation

sourceseedPaperspreprint

ai-securityai-for-securityai-socopen-weight-modelsopensoc-aitinyllamaevaluation-methodology

Sources2026-07-04

SRC-20260703-open-weight-ai-soc

sourceactiveNotes

ai-securityai-socopen-weight-modelssource-ingest

Methods2026-07-04

SOC Evaluation Parser Audit

methodactiveMethods

ai-securityai-socevaluationopen-weight-models

Research Questions2026-07-04

RQ-20260703-011-open-weight-ai-soc-evaluation

research-questionactiveResearch Questions

ai-securityai-socopen-weight-modelsevaluation

Claims2026-07-04

Open Weight SOC Models Need Evaluation Contracts

claimactiveClaims

ai-securityai-socopen-weight-modelsevaluation

Concepts2026-07-04

Open Weight Models for AI SOC

conceptactiveConcepts

ai-securityai-socopen-weight-models

Sources2026-07-04

Open Weight AI SOC Paper Collection

sourceseedPaperscollection

ai-securityai-for-securityai-socopen-weight-modelssource-collection

Sources2026-07-04

Llama-3.1-FoundationAI-SecurityLLM-Base-8B Technical Report

sourceseedPaperspreprint

ai-securityai-for-securityai-socopen-weight-modelscybersecurity-llmfoundation-secllama-3-1

Sources2026-07-04

Llama-3.1-FoundationAI-SecurityLLM-8B-Instruct Technical Report

sourceseedPaperspreprint

ai-securityai-for-securityai-socopen-weight-modelscybersecurity-llmfoundation-secinstruction-tuning

Sources2026-07-04

Evaluation of LLM Agents for the SOC Tier 1 Analyst Triage Process

sourceseedPapersthesis

ai-securityai-for-securityai-socopen-weight-modelsllama-3soc-tier-1alert-triage

Concepts2026-07-04

Threat Models

conceptseedWiki

ai-securitythreat-model

Methods2026-07-04

Threat Modeling Agentic Systems

methodactiveMethods

ai-securitymethodsbatch-ingest

Sources2026-07-04

SRC-20260702-karpathy-llm-wiki

sourceseedNotes

ai-securityllm-wikiknowledge-management

Sources2026-07-04

Source Intake

sourceactiveIntake

ai-securitysource-intake

Methods2026-07-04

Runtime Monitoring and Agent Gateways

methodactiveMethods

ai-securitymethodsbatch-ingest

Research Questions2026-07-04

RQ-20260702-010-agent-runtime-monitoring

research-questionactiveResearch Questions

ai-securitybatch-ingest

Research Questions2026-07-04

RQ-20260702-009-ai-soc-human-factors

research-questionactiveResearch Questions

ai-securitybatch-ingest

Research Questions2026-07-04

RQ-20260702-008-rag-poisoning-controls

research-questionactiveResearch Questions

ai-securitybatch-ingest

Research Questions2026-07-04

RQ-20260702-007-action-scoped-authorization

research-questionactiveResearch Questions

ai-securitybatch-ingest

Research Questions2026-07-04

RQ-20260702-006-benchmark-to-incident-validity

research-questionactiveResearch Questions

ai-securitybatch-ingest

Research Questions2026-07-04

RQ-20260702-005-memory-poisoning-defense

research-questionactiveResearch Questions

ai-securitybatch-ingest

Research Questions2026-07-04

RQ-20260702-004-agent-protocol-security

research-questionactiveResearch Questions

ai-securitybatch-ingest

Research Questions2026-07-04

RQ-20260702-003-defense-generalization

research-questionseedResearch Questions

ai-securitydefenses

Research Questions2026-07-04

RQ-20260702-002-benchmark-validity

research-questionseedResearch Questions

ai-securityevaluation

Research Questions2026-07-04

RQ-20260702-001-agent-security

research-questionseedResearch Questions

ai-securityagents

Research Questions2026-07-04

Research Questions Index

outputactiveResearch Questions

ai-securityresearch-questions

Methods2026-07-04

Red Teaming Agentic AI

methodactiveMethods

ai-securitymethodsbatch-ingest

Sources2026-07-04

Raw Whitepapers Batch Ingest

sourceactiveNotes

ai-securitybatch-ingestwhitepapers

Sources2026-07-04

Raw Papers Batch Ingest

sourceactiveNotes

ai-securitybatch-ingestpapers

Sources2026-07-04

Raw News Batch Ingest

sourceactiveNotes

ai-securitybatch-ingestnews

Synthesis2026-07-04

Raw Corpus Synthesis 2026-07-02

synthesisactiveSynthesis

ai-securitybatch-ingestsynthesis

Concepts2026-07-04

RAG and Retrieval Security

conceptactiveConcepts

ai-securitybatch-ingest

Claims2026-07-04

Prompt Injection Defenses Depend On Deployment Context

claimactiveClaims

ai-securityclaimbatch-ingest

Concepts2026-07-04

Prompt Injection and Context Attacks

conceptactiveConcepts

ai-securitybatch-ingest

Claims2026-07-04

Persistent Memory Creates Poisoning And Provenance Risks

claimactiveClaims

ai-securityclaimbatch-ingest

Sources2026-07-04

PAIEL: Protocol-Aware and Context-Integrated Protocol Explanation Using LLMs for SOCs

sourceseedPapersconference_paper

ai-securityai-for-securityai-socprotocol-analysisragstructured-contextcontext-compression

Sources2026-07-04

Non-Disruptive Disruption: An Empirical Experience of Introducing LLMs in the SOC

sourceseedPapersconference_paper

ai-securityai-for-securityai-socethnographyhuman-ai-collaborationco-creation

Concepts2026-07-04

Model Extraction and Privacy Leakage

conceptactiveConcepts

ai-securitybatch-ingest

Memory Poisoning and Agent State

conceptactiveConcepts

ai-securitybatch-ingest

Concepts2026-07-04

MCP and Agent Protocol Security

conceptactiveConcepts

ai-securitybatch-ingest

log

InkJect: The Visual Prompt Injection That Text Defenses Were Never Built to Stop

sourceseedNewsindustry_blog

ai-securityvisual-prompt-injectionindirect-prompt-injectionvlmmultimodal-agent

index

Evidence Grading for AI Security

methodactiveMethods

ai-securitymethodsbatch-ingest

Concepts2026-07-04

Evaluation Benchmarks for AI Security

conceptactiveConcepts

ai-securitybatch-ingest

Current Synthesis

Cognitive Threat Detection for SOC Operations: Automating Manipulation Tactic Analysis in Election Security

sourceseedPapersconference_paper

ai-securityai-for-securityai-socelection-securitycognitive-threatllm-routing

Benchmarks May Not Predict Deployment Risk

claimactiveClaims

ai-securityclaimbatch-ingest

Methods2026-07-04

Benchmark-Based Security Evaluation

methodactiveMethods

ai-securitymethodsbatch-ingest

Concepts2026-07-04

AI Security Taxonomy

conceptseedWiki

ai-securitytaxonomy

Portal2026-07-04

AI Security Research Portal

outputseedPortal

ai-securityportal

Concepts2026-07-04

AI Security Governance and Standards

conceptactiveConcepts

ai-securitybatch-ingest

Concepts2026-07-04

AI Cybersecurity Operations

conceptactiveConcepts

ai-securitybatch-ingest

Claims2026-07-04

Agentic Systems Expand The Security Boundary

claimactiveClaims

ai-securityclaimbatch-ingest

Concepts2026-07-04

Agent Security and Tool Abuse

conceptactiveConcepts

ai-securitybatch-ingest

Concepts2026-07-04

Agent Identity and Authorization

conceptactiveConcepts

ai-securitybatch-ingest

Claims2026-07-04

Agent Authorization Should Be Action Scoped

claimactiveClaims

ai-securityclaimbatch-ingest

Sources2026-07-04

True Attacks, Attack Attempts, or Benign Triggers? An Empirical Measurement of Network Alerts in a Security Operations Center

sourceseedPapersconference_paper

ai-securityai-for-securityai-socalert-triageempirical-measurementground-truth

Sources2026-07-04

That Escalated Quickly: An ML Framework for Alert Prioritization

sourceseedPaperspreprint

ai-securityai-for-securityai-socmachine-learningalert-prioritizationmanaged-security

Sources2026-07-04

OpenSOC-AI: Democratizing Security Operations with Parameter Efficient LLM Log Analysis

sourceseedPaperspreprint

ai-securityai-for-securityai-socllmloralog-analysissmbs

Sources2026-07-04

NLLog: Lightweight, Explainable SOC Anomaly Detection via Log-to-Language Rewriting

sourceseedPaperspreprint

ai-securityai-for-securityai-socanomaly-detectionexplainabilitylog-analysis

Sources2026-07-04

LanG -- A Governance-Aware Agentic AI Platform for Unified Security Operations

sourceseedPaperspreprint

ai-securityai-for-securityai-socagentic-aigovernancemcphuman-in-the-loop

Sources2026-07-04

Improved Detection and Response via Optimized Alerts: Usability Study

sourceseedPapersjournal_paper

ai-securityai-for-securityai-socmachine-learningalert-fatigueusability

Sources2026-07-04

DEEPCASE: Semi-Supervised Contextual Analysis of Security Events

sourceseedPapersconference_paper

ai-securityai-for-securityai-socdeep-learningevent-correlationsemi-supervised-learning

Sources2026-07-04

Context2Vector: Accelerating security event triage via context representation learning

sourceseedPapersjournal_paper

ai-securityai-for-securityai-socrepresentation-learningalert-triagehuman-in-the-loop

Sources2026-07-04

An Assessment of the Usability of Machine Learning Based Tools for the Security Operations Center

sourceseedPaperspreprint

ai-securityai-for-securityai-socmachine-learningusabilityhuman-ai-collaboration

Sources2026-07-04

AI and Pentesting Pulse Report 2026

sourceseedWhitepapersindustry_report

ai-securityllm-pentestingautomated-scanningfalse-negativeremediation

Sources2026-07-04

A user-centric machine learning framework for cyber security operations center

sourceseedPapersconference_paper

ai-securityai-for-securityai-socmachine-learningalert-triageuser-centric

Sources2026-07-04

SIR-Bench: Evaluating Investigation Depth in Security Incident Response Agents

sourceseedPaperspaper

ai-securityai-for-securityai-socincident-responseincident-replaybenchmarkforensic-investigation

Sources2026-07-04

Severity-based triage of cybersecurity incidents using kill chain attack graphs

sourceseedPapersjournal_paper

ai-securityai-for-securityai-socalert-triageattack-graphmitre-attackincident-replay

Sources2026-07-04

Security risk management in the digital enterprise: enhancing cyber defense with large language models

sourceseedPapersjournal_paper

ai-securityai-for-securityai-socllmnetwork-telemetrydeployment-evaluationq1-journal

Sources2026-07-04

Securing AI Agents with Cisco AI Defense

sourceseedNewsofficial_vendor_blog

ai-securityai-agent-securityruntime-protectionmcpprompt-injectionguardrails

Sources2026-07-04

Large Language Models Can Provide Accurate and Interpretable Incident Triage

sourceseedPapersconference_paper

ai-securityai-for-securityincident-triagellmcloud-operationsinterpretabilityconference

Sources2026-07-04

Integrating Large Language Models into Security Incident Response

sourceseedPapersconference_paper

ai-securityai-for-securityai-socincident-responsehuman-ai-collaborationsummarizationconference

Sources2026-07-04

Carbon Filter: Scalable, Efficient, and Secure Alert Triage for Endpoint Detection & Response

sourceseedPapersconference_paper

ai-securityai-for-securityai-socalert-triageendpoint-detection-responseclusteringconference

Sources2026-07-04

Alert Fatigue in Security Operations Centres: Research Challenges and Opportunities

sourceseedPapersjournal_paper

ai-securityai-for-securityai-socalert-fatiguehuman-ai-collaborationsurveyq1-journal

Sources2026-07-04

AI SOC Q1 Journal and Peer-Reviewed Conference Collection

sourceseedPaperscollection_manifest

ai-securityai-for-securityai-socq1-journalpeer-reviewedcollection-manifest

Sources2026-07-04

AECR: Automatic attack technique intelligence extraction based on fine-tuned large language model

sourceseedPapersjournal_paper

ai-securityai-for-securityai-soccyber-threat-intelligencemitre-attackllmq1-journal

Sources2026-07-04

SafeClawBench: Separating Semantic, Audit-Evidence, and Sandbox Harm in Tool-Using LLM Agents

sourceseedPaperspaper

ai-securitysecurity-for-aibenchmarkagent-securityprompt-injectionmemory-poisoningevaluation

Sources2026-07-04

Prompt Injection in Automated Résumé Screening with Large Language Models: Single and Multi-Injection Settings

sourceseedPaperspaper

ai-securitysecurity-for-aiprompt-injectionhiring-workflowdecision-integritypeer-reviewed

Sources2026-07-04

PI-Hunter: Automated Red-Teaming for Exposing and Localizing Prompt Injections

sourceseedPaperspaper

ai-securitysecurity-for-aiprompt-injectionred-teaminglocalizationagent-security

Sources2026-07-04

MCP Auto-Execution: From Git Clone to Cloud Compromise in Amazon Q VS Code Extension

sourceseedNewsindustry_blog

ai-securityamazon-qmcpcoding-agentworkspace-trustcredential-theftcve-2026-12957

Sources2026-07-04

LivePI: More Realistic Benchmarking of Agents Against Indirect Prompt Injection

sourceseedPaperspaper

ai-securitysecurity-for-aiindirect-prompt-injectionexecutable-harmvirtual-machinebenchmark

Sources2026-07-04

How Vulnerable Are AI Agents to Indirect Prompt Injections? Insights from a Large-Scale Public Competition

sourceseedPaperspaper

ai-securitysecurity-for-aiindirect-prompt-injectionred-teamingcomputer-useconcealmentbenchmark

Sources2026-07-04

GitInject: Real-World Prompt Injection Attacks in AI-Powered CI/CD Pipelines

sourceseedPaperspaper

ai-securitysecurity-for-aiprompt-injectionci-cdsupply-chaincoding-agentsbenchmark

Sources2026-07-04

DecodingTrust-Agent Platform (DTap): A Controllable and Interactive Red-Teaming Platform for AI Agents

sourceseedPaperspaper

ai-securitysecurity-for-aiagent-securityred-teamingprompt-injectiontool-injectionskill-injection

Sources2026-07-04

Authorization Propagation in Multi-Agent AI Systems: Identity Governance as Infrastructure

sourceseedPaperspaper

ai-securitysecurity-for-aimulti-agent-systemsauthorizationidentity-governancedelegation

Sources2026-07-04

AI Security Paper Collection 2026-06-29

sourceseedPaperscollection

ai-securitycollectionpapersweekly-ingest

Sources2026-07-04

AI Agents May Always Fall for Prompt Injections

sourceseedPaperspaper

ai-securitysecurity-for-aiprompt-injectioncontextual-integrityinformation-flowdefense-limitations

Sources2026-07-04

AgentDyn: Are Your Agent Security Defenses Deployable in Real-World Dynamic Environments?

sourceseedPaperspaper

ai-securitysecurity-for-aiagent-securityprompt-injectionbenchmarkdynamic-tasks

Sources2026-07-04

Introducing computer use in Gemini 3.5 Flash

sourceseedNewsofficial_blog

ai-securitycomputer-usegemini-3-5-flashprompt-injectionagent-security

Sources2026-07-04

Chinese cybersecurity company 360 unveils “China's version of Mythos”, and Yitianzhen, to automate cyber defense

sourceseedNewsnews

ai-securitycyber-modelvulnerability-discoveryautomated-defensedual-use

Sources2026-07-04

OpenAI limits its latest ChatGPT product to Trump-approved customers during cybersecurity review

sourceseedNewsnews

ai-securityfrontier-modelscyber-capabilityphased-releasegovernance

Sources2026-07-04

Exclusive: Gottheimer and Moolenaar roll out AI cloud security bill

sourceseedNewsnews

ai-securitycloud-computemodel-developmentmisuse-detectionpolicy

Sources2026-07-04

We have Mythos at Home: GLM 5.2 beats Claude in our Cyber Benchmarks

sourceseedNewsindustry_blog

ai-securitycyber-benchmarksvulnerability-detectionidorglm-5-2

Sources2026-07-04

Toward Autonomous SOC Operations: End-to-End LLM Framework for Threat Detection, Query Generation, and Resolution in Security Operations

sourceseedPaperspaper

ai-securityai-for-securityai-socautonomous-socquery-generationsiemrag

Sources2026-07-04

SuperLocalMemory: Privacy-Preserving Multi-Agent Memory with Bayesian Trust Defense Against Memory Poisoning

sourceseedPaperspaper

ai-securitymulti-agent-memorymemory-poisoningbayesian-trustarchitectural-isolationprovenancemcp

Sources2026-07-04

State Contamination in Memory-Augmented LLM Agents

sourceseedPaperspaper

ai-securitystate-contaminationmemory-launderingmulti-agent-rolloutspersistent-statememory-poisoningmas-misevolution-propagation

Sources2026-07-04

Self-Evolving Multi-Agent Systems via Decentralized Memory

sourceseedPaperspaper

ai-securitymulti-agentself-evolving-agentsdecentralized-memorypersistent-memoryllm-as-a-judgemas-misevolution-propagation

Sources2026-07-04

Retrieval-Augmented LLMs for Security Incident Analysis

sourceseedPaperspaper

ai-securityai-for-securityai-socsecurity-incident-analysisragmitre-attacklog-analysis

Sources2026-07-04

On the Resilience of LLM-Based Multi-Agent Collaboration with Faulty Agents

sourceseedPapersconference_paper

ai-securitymulti-agentfaulty-agentsresilienceautoinjectautotransforminspector

Sources2026-07-04

Memory Poisoning Propagation and Repair Mechanism in Multi-Agent Collaborative Environments

sourceseedPaperspaper

ai-securitymemory-poisoningpropagationmulti-agentevidence-graphrepaircontrastive-learning

Sources2026-07-04

Memory poisoning and secure multi-agent systems

sourceseedPaperspaper

ai-securitymemory-poisoningmulti-agentsecure-massemantic-memoryepisodic-memorymas-misevolution-propagation

Sources2026-07-04

MAS Misevolution Propagation Collection 2026-06-26

sourceseedPaperscollection_index

ai-securitycollectionmas-misevolution-propagationmulti-agentmemory-poisoningerror-cascadeself-evolving-agents

Sources2026-07-04

macOS.Gaslight | Rust Backdoor Turns Prompt Injection on the Analyst, Not the Sandbox

sourceseedNewsindustry_blog

ai-securityprompt-injectionmalware-analysismacosdprk

Sources2026-07-04

Like a Hammer, It Can Build, It Can Break: Large Language Model Uses, Perceptions, and Adoption in Cybersecurity Operations on Reddit

sourceseedPaperspaper

ai-securityai-for-securityai-socpractitioner-studyadoptionreddithuman-ai-collaboration

Sources2026-07-04

Large Language Models for Security Operations Centers: A Comprehensive Survey

sourceseedPaperspaper

ai-securityai-for-securityai-socsurveysecurity-operationsllmthreat-intelligence

Sources2026-07-04

IRCopilot: Automated Incident Response with Large Language Models

sourceseedPaperspaper

ai-securityai-for-securityai-socincident-responsellm-agenthallucinationprivacy

Sources2026-07-04

GLM 5.2 on CyberBT-CTF: The strongest open source contender to Anthropic/OpenAI we have tested

sourceseedNewsindustry_blog

ai-securitycyber-benchmarksopen-weight-modelsglm-5-2model-distillation

Sources2026-07-04

From Spark to Fire: Modeling and Mitigating Error Cascades in LLM-Based Multi-Agent Collaboration

sourceseedPaperspaper

ai-securitymulti-agenterror-cascadepropagationgenealogy-graphllm-masmas-misevolution-propagation

Sources2026-07-04

CORTEX: Collaborative LLM Agents for High-Stakes Alert Triage

sourceseedPaperspaper

ai-securityai-for-securityai-socalert-triagemulti-agentauditabilityproduction-soc

Sources2026-07-04

Collaborative Memory: Multi-User Memory Sharing in LLM Agents with Dynamic Access Control

sourceseedPaperspaper

ai-securitycollaborative-memorymulti-agentaccess-controlprovenanceshared-memoryauditability

Sources2026-07-04

Before You Hand Over the Wheel: Evaluating LLMs for Security Incident Analysis

sourceseedPaperspaper

ai-securityai-for-securityai-socsecurity-incident-analysisalert-triagebenchmarkagentic-evaluation

Sources2026-07-04

Anthropic accuses Alibaba of running the largest distillation campaign yet against Claude

sourceseedNewsnews

ai-securitymodel-extractionmodel-distillationclaudeqwenalibaba

Sources2026-07-04

AI-Driven Guided Response for Security Operation Centers with Microsoft Copilot for Security

sourceseedPaperspaper

ai-securityai-for-securityai-socguided-responsemicrosoft-security-copilotincident-triageremediation

Sources2026-07-04

AgentSOC: A Multi-Layer Agentic AI Framework for Security Operations Automation

sourceseedPaperspaper

ai-securityai-for-securityai-socagentic-socsecurity-operationsincident-responserisk-based-response

Sources2026-07-04

Your Agent May Misevolve: Emergent Risks in Self-evolving LLM Agents

sourceactivePapersconference_paper

ai-securityself-evolving-agentsmisevolutionagent-securitymemorytoolsworkflow-evolution

Sources2026-07-04

When Poison Fails After Retrieval: Revisiting Corpus Poisoning under Chunking and Reranking Pipelines

sourceseedPaperspaper

ai-securityragcorpus-poisoningchunkingrerankingretrieval

Sources2026-07-04

When AUC 0.998 Is Not Enough: A Candidate Evaluation Protocol for Hidden-State Probes of Indirect Prompt Injection in Multimodal Computer-Use Agents

sourceseedPaperspaper

ai-securityindirect-prompt-injectionmultimodal-agentcomputer-use-agenthidden-state-probesevaluation

Sources2026-07-04

What Intermediate Layers Know: Detecting Jailbreaks from Entropy Dynamics

sourceseedPaperspaper

ai-securityjailbreakdetectionmechanistic-interpretabilityguardrails

Sources2026-07-04

TrustedARI: Towards Trust-Native Agentic Routing Infrastructure for Agentic AI

sourceseedPaperspaper

ai-securityagentic-airoutingtrust-infrastructuremulti-agent-systems

Sources2026-07-04

Tracing Target Answers in Poisoned Retrieval Corpora via Token Influence Attribution

sourceseedPaperspaper

ai-securityragretrieval-poisoningattributionprovenance

Sources2026-07-04

The State of AI Security Report 2026

sourceseedWhitepapersvendor_report

ai-securityindustry-reportthreat-intelligencegovernanceenterprise-ai

Sources2026-07-04

The State of AI Cybersecurity 2026

sourceseedWhitepapersvendor_report

ai-securityai-for-securitysocsecurity-operationsciso-surveyindustry-report

Sources2026-07-04

The Injection Paradox: Brand-Level Suppression in Safety-Trained LLM Recommendations via RAG Context Injection

sourceseedPaperspaper

ai-securityragcontext-injectionrecommendationprompt-injection

Sources2026-07-04

SS-ZKR: Spatial-Semantic Zero-Knowledge Routing for Privacy-Preserving Multi-Agent Collaboration

sourceseedPaperspaper

ai-securitymulti-agent-systemsprivacyroutingzero-knowledgea2amcp

Sources2026-07-04

SoK: The Attack Surface of Agentic AI -- Tools, and Autonomy

sourceactivePaperspreprint

ai-securityagentic-aiattack-surfacetoolsragautonomymulti-agent-security

Sources2026-07-04

SMSR: Certified Defence Against Runtime Memory Poisoning in Persistent LLM Agent Systems

sourceseedPaperspaper

ai-securityagent-memorymemory-poisoningcertified-defensepersistent-agents

Sources2026-07-04

Security and Privacy in Retrieval-Augmented Generation: Architectures, Threats, Defenses, and Future Directions for Building Trustworthy Systems

sourceseedPaperspaper

ai-securityragprivacysurveythreat-modeldefense

Sources2026-07-04

Securing LLM-Agent Long-Term Memory Against Poisoning: Non-Malleable, Origin-Bound Authority with Machine-Checked Guarantees

sourceseedPaperspaper

ai-securityagent-memorymemory-poisoningformal-methodsprovenance

Sources2026-07-04

Securing Agentic AI

sourceseedWhitepapersvendor_whitepaper

ai-securityagentic-aicontrolsenterprise-aigovernanceruntime-security

Sources2026-07-04

Scalable Hierarchical Attention Transformers for Multi-Turn Jailbreak Detection in Long Conversations

sourceseedPaperspaper

ai-securityjailbreakmulti-turnlong-contextdetection

Sources2026-07-04

Same-Origin Policy for Agentic Browsers

sourceseedPaperspaper

ai-securityagentic-browsersame-origin-policyweb-securityprompt-injection

Sources2026-07-04

Safe to Check, Unsafe to Use: Relinking at the Compression Boundary of LLM Agents

sourceseedPaperspaper

ai-securityllm-agentcontext-compressionprompt-injectionagent-memory

Sources2026-07-04

REALM: A Unified Red-Teaming Benchmark for Physical-World VLMs

sourceseedPaperspaper

ai-securityvlmred-teamingbenchmarkmultimodal-security

Sources2026-07-04

RAVEN: Agentic RAG for Automated Vulnerability Repair

sourceseedPaperspaper

ai-securityai-for-securityvulnerability-repairagentic-ragsoftware-security

Sources2026-07-04

RAILS: Verification-Native Clearing For Agentic Commerce

sourceseedPaperspaper

ai-securityagentic-commerceverificationsettlement-riskagent-integritynon-human-identity

Sources2026-07-04

Privacy-Preserving RAG via Multi-Agent Semantic Rewriting: Achieving Confidentiality Without Compromising Contextual Fidelity

sourceseedPaperspaper

ai-securityragprivacymulti-agent-systemssemantic-rewriting

Sources2026-07-04

Poisoned Playbooks: Demystifying Knowledge Poisoning Effects on AI Security Agents

sourceseedPaperspaper

ai-securityai-for-securitysoc-agentknowledge-poisoningplaybooks

Sources2026-07-04

PixJail: Self-Evolving Paper-to-Pipeline Reproduction for Text-to-Image Jailbreak Evaluation

sourceseedPaperspaper

ai-securitytext-to-imagejailbreakevaluationself-evolving

Sources2026-07-04

OTTER: A Red-Teaming System for Toxicity-Evading Jailbreak Prompt Optimization

sourceseedPaperspaper

ai-securityjailbreakred-teamingprompt-optimizationsafety-evaluation

Sources2026-07-04

OpenAgenet / OAN Yellow Paper: Technical Architecture for Trust-Governed Resource Identity and Discovery

sourceseedPaperspaper

ai-securityagent-identityresource-discoverytrust-layera2amcpskills

Sources2026-07-04

One Goal, Many Commands: Characterizing Denylist Fragility in AI Agents

sourceseedPaperspaper

ai-securityagent-securitydenylistpolicy-enforcementtool-use

Sources2026-07-04

More Malicious OpenClaw Skills Threaten AI Supply Chain

sourceseedNewsnews

ai-securityagentic-aiopenclawmalicious-skillssupply-chainnews

Sources2026-07-04

Let Them Steal: Trapping Large Language Model Extraction Attacks with Knowledge Honeypot

sourceseedPaperspaper

ai-securitymodel-extractionhoneypotllmdefense

Sources2026-07-04

Latest AI Security Collection 2026-06-25

sourceseedNewscollection_manifest

ai-securitycollection-manifestlatestagentic-aimcpai-for-security

Sources2026-07-04

Influence Factors on RAG Poisoning

sourceseedPaperspaper

ai-securityragpoisoningretrievalevaluation

Sources2026-07-04

How Reliable Is Your Jailbreak Judge? Calibration and Adversarial Robustness of Automated ASR Scoring

sourceseedPaperspaper

ai-securityjailbreakevaluationasrllm-judgecalibration

Sources2026-07-04

Honeyquest for LLMs: Rethinking Cyber Deception for AI Attackers

sourceseedPaperspaper

ai-securityai-for-securitycyber-deceptionllm-attackershoneypotthreat-intelligence

Sources2026-07-04

Global Cybersecurity Outlook 2026

sourceseedWhitepaperswhitepaper

ai-securitycybersecurity-trendspolicycyber-readinessindustry-report

Sources2026-07-04

GIF: Locally Sound Geometric Information Flow Control for LLMs

sourceseedPaperspaper

ai-securityinformation-flow-controlllmdata-leakageformal-methods

Sources2026-07-04

Ghost Vectors: Soft-Deleted Embeddings Remain Reconstructible in HNSW Vector Databases

sourceseedPaperspaper

ai-securityvector-databaseembeddingsprivacyragdata-deletion

Sources2026-07-04

Game-Theoretic Multi-Agent Control for Robust Contextual Reasoning in LLMs

sourceseedPaperspaper

ai-securitymcpcontext-poisoningprompt-injectionmulti-agent-controlrollback

Sources2026-07-04

From Privacy to Workflow Integrity: Communication-Graph Metadata in Autonomous Agent Interoperability

sourceseedPaperspaper

ai-securitymulti-agent-systemsmetadata-leakageworkflow-integritya2amcp

Sources2026-07-04

Document-Authored Control-Signal Impersonation: A Low-Cost Indirect Prompt Attack on RAG Safety Boundaries

sourceseedPaperspaper

ai-securityragindirect-prompt-injectioncontrol-signalsafety-boundary

Sources2026-07-04

Detecting Malicious Agent Skills in the Wild using Attention

sourceseedPaperspaper

ai-securityagent-skillsmalicious-skillssupply-chaindetection

Sources2026-07-04

Cybersecurity Forecast 2026

sourceseedWhitepapersvendor_report

ai-securityai-for-securitythreat-forecastsocsecurity-operationscybersecurity-trends

Sources2026-07-04

Conflict-Aware Retriever Editing for Knowledge Injection Attacks on LLM-Based RAG Systems

sourceseedPaperspaper

ai-securityragknowledge-injectionretriever-editingpoisoning

Sources2026-07-04

Confidently Wrong: Severity-Aware Calibration of Prompt-Injection Detectors under Attack Shift

sourceseedPaperspaper

ai-securityprompt-injectiondetector-calibrationrobustnessevaluation

Sources2026-07-04

Code-Augur: Agentic Vulnerability Detection via Specification Inference

sourceseedPaperspaper

ai-securityai-for-securityvulnerability-detectionagentic-aispecification-inferencesoftware-security

Sources2026-07-04

Behind the Curtain: Global AI wars

sourceseedNewsnews

ai-securityfrontier-aicyber-capabilitygeopoliticsfive-eyesnews

Sources2026-07-04

An Evaluation of Data Leakage Risks in Tool-Using LLM Agents in Realistic Scenarios

sourceseedPaperspaper

ai-securitytool-using-agentdata-leakageprivacyagent-security

Sources2026-07-04

AIChilles: Automatically Uncovering Hidden Weaknesses in AI-Evolved Systems

sourceseedPaperspaper

ai-securityself-evolving-aiweakness-discoveryai-evolved-systemstesting

Sources2026-07-04

AI Snitches Get Glitches: Towards Evading Agentic Surveillance

sourceseedPaperspaper

ai-securityagentic-aisurveillance-evasionadversarialmonitoring

Sources2026-07-04

AI Security Paper Collection 2026-06-25

sourceseedPaperscollection_manifest

ai-securitypaperscollection-manifestragagent-securityjailbreakai-for-security

Sources2026-07-04

Agents All the Way Down; A Methodology for Building Custom AI Agents from Substrate to Production

sourceseedPaperspaper

ai-securityai-technologyagentic-aicustom-agentsagent-methodologysecurity-boundariesaudit-trail

Sources2026-07-04

AgentRiskBOM: A Risk-Scoping Security Bill of Materials for Agentic AI Systems

sourceseedPaperspaper

ai-securityagentic-aisbomrisk-managementgovernance

Sources2026-07-04

AgentLens: Interpretable Safety Steering via Mechanistic Subspaces for Multi-Turn Coding Agent

sourceseedPaperspaper

ai-securitycoding-agentsafety-steeringinterpretabilitymulti-turn-agent

Sources2026-07-04

AgenticOS: An Intent-Oriented Secure Operating System Architecture for Autonomous AI Agents

sourceseedPaperspaper

ai-securityagentic-aisecure-osintentruntime-security

Sources2026-07-04

AgentCyberRange: Benchmarking Frontier AI Systems in Realistic Cyber Ranges

sourceseedPaperspaper

ai-securityai-for-securitycyber-rangebenchmarkfrontier-aicyber-capability

Sources2026-07-04

AgentCanary: A Security Evaluation Framework for Autonomous AI Agents in Real Executable Environments

sourceseedPaperspaper

ai-securityagent-securitybenchmarkexecutable-environmentevaluation

Sources2026-07-04

Agent-Assisted Side-Channel Attacks on Non-Prefix KV Cache in RAG

sourceseedPaperspaper

ai-securityragside-channelkv-cacheagent-assisted-attack

Sources2026-07-04

A Survey of Self-Evolving Agents: What, When, How, and Where to Evolve on the Path to Artificial Super Intelligence

sourceactivePaperssurvey

ai-securityself-evolving-agentssurveytaxonomyagent-memoryagent-toolsworkflow-evolution

Sources2026-07-04

A Layered Security Framework Against Prompt Injection in RAG-Based Chatbots

sourceseedPaperspaper

ai-securityragprompt-injectiondefensechatbot-security

Sources2026-07-04

\"What Happens Locally, Leaks Globally\": Detecting Privacy Leakage Risks in MCP Servers

sourceseedPaperspaper

ai-securitymcpprivacy-leakagestatic-analysistool-securityagent-security

Sources2026-07-04

ZERO-APT: A Closed-Loop Adversarial Framework for LLM-Driven Automated Penetration Testing under Intelligent Defense

sourceseedPaperspaper

ai-securityai-for-securitycyber-benchmarkpenetration-testingdefender-in-the-loopauditability

Sources2026-07-04

What If Prompt Injection Never Left? Exploring Cross-Session Stored Prompt Injection in Agentic Systems

sourceseedPaperspaper

ai-securitysecurity-for-aiprompt-injectionpersistent-contextagent-memorybenchmark

Sources2026-07-04

Voyager: An Open-Ended Embodied Agent with Large Language Models

sourceseedPaperspaper

ai-securitylifelong-learningembodied-agentskill-libraryexecutable-codeautomatic-curriculum

Sources2026-07-04

Self-Evolving Agent Rollout and Experience Buffer Collection

sourceseedPaperscollection_manifest

ai-securityself-evolving-agentrollout-bufferexperience-memoryattack-surfacecollection

Sources2026-07-04

SAGE: Multi-Agent Self-Evolution for LLM Reasoning

sourceseedPaperspaper

ai-securityself-evolving-agentmulti-agentcurriculum-poolcriticverifier

Sources2026-07-04

Role-Agent: Bootstrapping LLM Agents via Dual-Role Evolution

sourceseedPaperspaper

ai-securityself-evolving-agentco-evolutionrollout-trajectoriesfailure-historycurriculum

Sources2026-07-04

Reflexion: Language Agents with Verbal Reinforcement Learning

sourceseedPaperspaper

ai-securityverbal-reinforcement-learningepisodic-memory-bufferreflectiontrajectory

Sources2026-07-04

MemoryGraft: Persistent Compromise of LLM Agents via Poisoned Experience Retrieval

sourceseedPaperspaper

ai-securitymemory-poisoningexperience-retrievalpersistent-compromiserollout-buffer-security

Sources2026-07-04

Memory Poisoning Attack and Defense on Memory Based LLM-Agents

sourceseedPaperspaper

ai-securitymemory-poisoningquery-only-attackmemory-sanitizationtrust-aware-retrievaltemporal-decay

Sources2026-07-04

MemMorph: Tool Hijacking in LLM Agents via Memory Poisoning

sourceseedPaperspaper

ai-securitymemory-poisoningtool-hijackingtool-selectionaccumulated-experiencepersistent-state

Sources2026-07-04

MemLineage: Lineage-Guided Enforcement for LLM Agent Memory

sourceseedPaperspaper

ai-securitymemory-lineageprovenancemerkle-logderivation-dagsensitive-action-gate

Sources2026-07-04

MemEvoBench: Benchmarking Safety Risks from Memory Misevolution in LLM Agents

sourceseedPaperspaper

ai-securitymemory-misevolutionbenchmarkbiased-feedbacknoisy-toolslong-horizon-safety

Sources2026-07-04

MemAudit: Post-hoc Auditing of Poisoned Agent Memory via Causal Attribution and Structural Anomaly Detection

sourceseedPaperspaper

ai-securitymemory-auditcausal-attributionanomaly-detectioncounterfactual-replaypoisoning

Sources2026-07-04

From Untrusted Input to Trusted Memory: A Systematic Study of Memory Poisoning Attacks in LLM Agents

sourceseedPaperspaper

ai-securitysecurity-for-aiagent-memorymemory-poisoningbenchmarkpersistent-context

Sources2026-07-04

From package to postinstall payload: Inside the Mastra npm supply chain compromise by Sapphire Sleet

sourceseedNewssecurity_blog

ai-securitymastranpmsupply-chainpostinstallci-cdcredential-theft

Sources2026-07-04

ExpeL: LLM Agents Are Experiential Learners

sourceseedPaperspaper

ai-securityexperiential-learningexperience-poolsuccessful-trajectoriesfaissretrieval

Sources2026-07-04

Efficient and Sound Probabilistic Verification for AI Agents

sourceseedPaperspaper

ai-securitysecurity-for-airuntime-verificationguardrailspolicy-enforcementdatalog

Sources2026-07-04

CoEvolve: Training LLM Agents via Agent-Data Mutual Evolution

sourceseedPaperspaper

ai-securityself-evolving-agentagent-data-coevolutionrollout-trajectoriesuncertaintyforgetting

Sources2026-07-04

Benchmarking Security Risk Detection and Verification in Open Agentic Skill Ecosystems

sourceseedPaperspaper

ai-securitysecurity-for-aiagent-skillssupply-chainbenchmarksandboxruntime-verification

Sources2026-07-04

AgentPoison: Red-teaming LLM Agents via Poisoning Memory or Knowledge Bases

sourceseedPaperspaper

ai-securityagent-memoryknowledge-base-poisoningbackdoorretrieval-triggerred-teaming

Sources2026-07-04

AgentEvolver: Towards Efficient Self-Evolving Agent System

sourceseedPaperspaper

ai-securityself-evolving-agentreinforcement-learningexperience-poolrollout-buffertrajectory-attribution

Sources2026-07-04

Pickle in the Middle - Hijacking Vertex AI Model Uploads for Cross-Tenant RCE

sourceseedNewssecurity_blog

ai-securityvertex-aimodel-artifactbucket-squattingpicklercecloud-security

Sources2026-07-04

Securing the future of AI agents

sourceseedNewsresearch_blog

ai-securityai-controlagent-securitymonitoringinsider-threatdefense-in-depth

Sources2026-07-04

AutoJack: How a single page can RCE the host running your AI agent

sourceseedNewssecurity_blog

ai-securityagent-securityautogen-studiomcpwebsocketrcelocalhost

Sources2026-07-04

When Your AI Agent's Memory Becomes a Security Liability

sourceseedNewsincident_report

ai-securitysecurity-for-ailanggraphagent-memorycheckpointerrcesql-injection

Sources2026-07-04

The Meta hack shows there's more to AI security than Mythos

sourceseedNewsnews

ai-securitysecurity-for-aiai-agentaccount-recoveryidentity-verificationaccount-takeoverincident

Sources2026-07-04

Securing the Agentic AI Frontier: Palo Alto Networks and Databricks Deliver a New Standard for AI Security

sourceseedNewsofficial_blog

ai-securitysecurity-for-aiagentic-airuntime-securityai-gatewaymcpdata-security

Sources2026-07-04

Prompt injection still drives most agentic AI security failures in production

sourceseedNewsnews

ai-securitysecurity-for-aiagentic-aiprompt-injectioncoding-agentssupply-chainincidents

Sources2026-07-04

Duo Brings Identity and Authorization Across AI Agent Gateways

sourceseedNewsofficial_blog

ai-securitysecurity-for-aiagent-identitynon-human-identityauthorizationai-gatewaymcp

Sources2026-07-04

Build your own vulnerability harness

sourceseedNewsblog

ai-securityai-for-securityvulnerability-discoveryagent-orchestrationvalidation

Sources2026-07-04

AWS Security Agent adds threat modeling, Kiro power and Claude Code plugin, and more

sourceseedNewsofficial_blog

ai-securityai-for-securitysecurity-agentthreat-modelingstridecode-reviewmcp

Sources2026-07-04

AI Security News Collection 2026-06-19

sourceseedNewscollection_manifest

ai-securitynewscollectiontrend-monitoring

Sources2026-07-04

AI Agent Identity and Permission Challenges: How Uber and Auth0 Are Rethinking Access Control

sourceseedNewsnews

ai-securitysecurity-for-aimulti-agentagent-identitydelegated-authorityoauthmcp

Sources2026-07-04

Security Requirements for AI Agents

sourceseedWhitepapersstandards_draft

ai-securitysecurity-for-aimulti-agenta2aagent-identityaccess-controlstandards-draft

Sources2026-07-04

Arcade Raises $60M to Become the Secure Action Layer Behind Every Production AI Agent

sourceseedNewspress_release

ai-securitysecurity-for-aiagent-authorizationmcpgovernanceauditabilitymarket-signal

Sources2026-07-04

Anthropic AI dispute sparks concerns about U.S. cybersecurity defenses

sourceseedNewsnews

ai-securityai-for-securitycyber-defensepolicymodel-capabilitynews

Sources2026-07-04

Visualizing Secure MLOps (MLSecOps): A Practical Guide for Building Robust AI/ML Pipeline Security

sourceseedWhitepaperswhitepaper

ai-securityglossary-gapmlsecopsmlopsllmopsai-supply-chainmodel-security

Sources2026-07-04

Transitioning from MLOps to LLMOps: Navigating the Unique Challenges of Large Language Models

sourceseedPaperspaper

ai-securityglossary-gapllmopsmlopslarge-language-modelai-operations

Sources2026-07-04

NIST AI RMF Playbook

sourceseedWhitepapersofficial_guidance

ai-securityglossary-gapai-governanceai-risk-managementnist-ai-rmfgovern-map-measure-manage

Sources2026-07-04

Model Retraining upon Concept Drift Detection in Network Traffic Data Streams

sourceseedPaperspaper

ai-securityglossary-gapmodel-driftconcept-driftnetwork-securitymlopsanomaly-detection

Sources2026-07-04

GenAI Red Teaming Guide

sourceseedWhitepaperswhitepaper

ai-securityglossary-gapai-red-teaminggenai-securityowaspevaluation

Sources2026-07-04

Explainable AI in Cybersecurity Operations: Lessons Learned from User Studies

sourceseedPaperspaper

ai-securityglossary-gapexplainable-aixaisoccybersecurity-operationsanalyst-decision-support

Sources2026-07-04

CyberGym: Evaluating AI Agents' Cybersecurity Capabilities with Real-World Vulnerabilities at Scale

sourceseedPaperspaper

ai-securitycybergymai-for-securitycyber-benchmarkvulnerability-reproductionai-agentsoss-fuzz

Sources2026-07-04

AI Agents Are Getting Better at Writing Code—and Hacking It as Well

sourceseedNewsnews

ai-securitycybergymai-agentszero-daycyber-capabilitydual-usenews

Sources2026-07-04

AgentOps: Enabling Observability of LLM Agents

sourceseedPaperspaper

ai-securityglossary-gapagentopsai-agent-observabilityllm-agentsai-safety

Sources2026-07-04

You Live More Than Once: Towards Hierarchical Skill Meta-Evolving

sourceseedPaperspaper

ai-securityskill-evolvingmeta-evolvingagent-skillstest-time-learningself-evolving-agents

Sources2026-07-04

The AI Scientist: Towards Fully Automated Open-Ended Scientific Discovery

sourceseedPaperspaper

ai-securityai-scientistautomated-scientific-discoveryopen-ended-researchself-evolving-aiai-technology

Sources2026-07-04

SkillsBench: Benchmarking How Well Agent Skills Work Across Diverse Tasks

sourceseedPaperspaper

ai-securityagent-skillsskillsbenchself-generated-skillsbenchmarkself-evolving-agents

Sources2026-07-04

SkillOpt: Executive Strategy for Self-Evolving Agent Skills

sourceseedPaperspaper

ai-securityskilloptagent-skillsself-evolving-agentstext-space-optimizationmicrosoft

Sources2026-07-04

Poison with Style: A Practical Poisoning Attack on Code Large Language Models

sourceseedPaperspaper

ai-securitysecurity-for-aidata-poisoningcode-llmbackdoorcode-agents

Sources2026-07-04

Microsoft's open-source SkillOpt automatically upgrades AI agent skills without touching model weights

sourceseedNewsnews

ai-securityskilloptnewsself-evolving-agentsmicrosoftagent-skills

Sources2026-07-04

LLMs in the SOC: An Empirical Study of Human-AI Collaboration in Security Operations Centres

sourceseedPaperspaper

ai-securityai-for-securitysocllmhuman-ai-collaborationtriage

Sources2026-07-04

From Raw Experience to Skill Consumption: A Systematic Study of Model-Generated Agent Skills

sourceseedPaperspaper

ai-securityskilllensmodel-generated-skillsagent-skillsself-evolving-agentsmicrosoft

Sources2026-07-04

Experiences of Using Agentic AI to Fill Tooling Gaps in a Security Operations Center

sourceseedPaperspaper

ai-securityai-for-securitysocai-agentreact-agentalert-triageprompt-iteration

Sources2026-07-04

Data Agents Under Attack: Vulnerabilities in LLM-Driven Analytical Systems

sourceseedPaperspaper

ai-securitysecurity-for-aidata-agentsagent-securitydatabase-security

Sources2026-07-04

Darwin Godel Machine: Open-Ended Evolution of Self-Improving Agents

sourceseedPaperspaper

ai-securityself-improving-agentsopen-ended-evolutioncoding-agentsrecursive-self-improvementai-technology

Sources2026-07-04

CyberGym-E2E: Scalable Real-World Benchmark for AI Agents' End-to-End Cybersecurity Capabilities

sourceseedPaperspaper

ai-securityai-for-securitycyber-benchmarkvulnerability-discoverypatchingagents

Sources2026-07-04

Assessing Automated Prompt Injection Attacks in Agentic Environments

sourceseedPaperspaper

ai-securitysecurity-for-aiprompt-injectionagent-securitybenchmarkagentdojo

Sources2026-07-04

AlphaEvolve: A coding agent for scientific and algorithmic discovery

sourceseedWhitepaperswhitepaper

ai-securityalphaevolvealgorithm-discoveryevolutionary-coding-agentself-evolving-aigoogle-deepmind

Sources2026-07-04

AI Model Extraction Attacks: Bypassing Single-Client Assumptions in Defenses

sourceseedPaperspaper

ai-securitysecurity-for-aimodel-extractionapi-securitydistributed-adversary

Sources2026-07-04

State of Agentic AI Security and Governance 2.01

sourceseedWhitepapersofficial_whitepaper

ai-securitysecurity-for-aiagentic-aigovernanceowaspstandards

Sources2026-07-04

Careful Adoption of Agentic AI Services

sourceseedWhitepapersofficial_guidance

ai-securitysecurity-for-aiagentic-aiofficial-guidanceleast-privilegegovernance

Sources2026-07-04

AI Security Solutions Landscape For AI and Agentic Red Teaming Q2 2026

sourceseedWhitepapersofficial_landscape

ai-securitysecurity-for-aiagentic-aired-teamingowaspevaluation

Sources2026-07-04

AI Security Solutions Landscape for Agentic AI Q2 2026

sourceseedWhitepapersofficial_landscape

ai-securitysecurity-for-aiagentic-aiowasplifecycle-securitysecops

Sources2026-07-04

ZeroDayBench: Evaluating LLM Agents on Unseen Zero-Day Vulnerabilities

sourceseedPaperspaper

ai-securityai-for-securitybenchmarkzero-dayvulnerability-patchingllm-agents

Sources2026-07-04

Whisper Leak: a side-channel attack on Large Language Models

sourceseedPaperspaper

ai-securitysecurity-for-aiprivacyside-channelllm-traffic-analysismodel-security

Sources2026-07-04

Systematic Analysis of MCP Security

sourceseedPaperspaper

ai-securitysecurity-for-aimcp-securitytool-poisoningattack-taxonomybenchmark

Sources2026-07-04

Sleeper Agents: Training Deceptive LLMs that Persist Through Safety Training

sourceseedPaperspaper

ai-securitysecurity-for-aibackdoorsdeceptive-modelssafety-trainingmodel-security

Sources2026-07-04

Security Threat Modeling for Emerging AI-Agent Protocols

sourceseedPaperspaper

ai-securitysecurity-for-aiagent-protocolsthreat-modelingmcp-securitymulti-agent

Sources2026-07-04

Securing AI Agent Execution

sourceseedPaperspaper

ai-securitysecurity-for-aiagent-securitymcp-securityaccess-controlagentbound

Sources2026-07-04

SEC-bench: Automated Benchmarking of LLM Agents on Real-World Security Tasks

sourceseedPaperspaper

ai-securityai-for-securitybenchmarkllm-agentssecurity-tasksvulnerability-reproduction

Sources2026-07-04

SANS 2025 Threat Hunting Survey: Advancements in Threat Hunting Amid AI and Cloud Challenges

sourceseedWhitepaperswhitepaper

ai-securityai-for-securitythreat-huntingsocsurveycloud-securitysecurity-operations

Sources2026-07-04

Prompt Injection Attacks on Agentic Coding Assistants

sourceseedPaperspaper

ai-securitysecurity-for-aicoding-agentsprompt-injectionagent-securitysoftware-supply-chain

Sources2026-07-04

Poisoning Attacks on LLMs Require a Near-constant Number of Poison Samples

sourceseedPaperspaper

ai-securitysecurity-for-aidata-poisoningbackdoorstraining-data-securitymodel-security

Sources2026-07-04

OWASP Top 10 for MCP

sourceseedWhitepapersstandard

ai-securitysecurity-for-aimcp-securityowaspagent-securitytaxonomy

Sources2026-07-04

Model Context Protocol Threat Modeling and Analyzing Vulnerabilities to Prompt Injection with Tool Poisoning

sourceseedPaperspaper

ai-securitysecurity-for-aimcp-securitythreat-modelingtool-poisoningstride

Sources2026-07-04

Model Context Protocol (MCP): Security Design Considerations for AI-Driven Automation

sourceseedWhitepapersgovernment_guidance

ai-securitysecurity-for-aimcp-securityagent-securitygovernment-guidanceautomation

Sources2026-07-04

Measuring AI Agents' Progress on Multi-Step Cyber Attack Scenarios

sourceseedPaperspaper

ai-securityai-for-securitybenchmarkcyber-rangemulti-step-attackllm-agentsrisk-evaluation

Sources2026-07-04

MCPTox: A Benchmark for Tool Poisoning Attack on Real-World MCP Servers

sourceseedPaperspaper

ai-securitysecurity-for-aimcp-securitytool-poisoningbenchmarkagent-security

Sources2026-07-04

Large Language Models for Cyber Security: A Systematic Literature Review

sourceseedPaperspaper

ai-securityai-for-securitysurveyllm4securityvulnerability-detectionmalware-analysisthreat-intelligence

Sources2026-07-04

GTIG AI Threat Tracker: Advances in Threat Actor Usage of AI Tools

sourceseedWhitepaperswhitepaper

ai-securityai-for-securitythreat-intelligenceai-misusemalwareattack-lifecycledefensive-detection

Sources2026-07-04

Generative-AI Empowered Cyber Threat Intelligence Forecasting

sourceseedWhitepaperswhitepaper

ai-securityai-for-securitycyber-threat-intelligenceforecastingragdefensive-ai

Sources2026-07-04

Generative AI in Cybersecurity: A Comprehensive Review of LLM Applications and Vulnerabilities

sourceseedPaperspaper

ai-securityai-for-securitysurveygenaithreat-detectionincident-responsedatasets

Sources2026-07-04

DARPA's AI Cyber Challenge (AIxCC): Competition Design, Results, and Lessons

sourceseedPaperspaper

ai-securityai-for-securityaixcccyber-reasoning-systemvulnerability-discoverypatchingcompetition

Sources2026-07-04

CyberSecEval 4

sourceseedWhitepaperswhitepaper

ai-securityai-for-securitybenchmarkcybersecevalautopatchbenchvulnerability-patching

Sources2026-07-04

CyberSecEval 3: Advancing the Evaluation of Cybersecurity Risks and Capabilities in Large Language Models

sourceseedPaperspaper

ai-securityai-for-securitysecurity-for-aibenchmarkcybersecevaloffensive-securityllm-evaluation

Sources2026-07-04

Cybench: A Framework for Evaluating Cybersecurity Capabilities and Risks of Language Models

sourceseedPaperspaper

ai-securityai-for-securitybenchmarkctfcyber-rangellm-agentscapability-evaluation

Sources2026-07-04

CVE-Bench: Benchmarking LLM-based Software Engineering Agents' Ability to Fix Real-world Vulnerabilities

sourceseedPaperspaper

ai-securityai-for-securitybenchmarkcve-benchvulnerability-repairsoftware-engineering-agents

Sources2026-07-04

CVE-Bench: A Benchmark for AI Agents' Ability to Exploit Real-World Web Application Vulnerabilities

sourceseedPaperspaper

ai-securityai-for-securitybenchmarkcve-benchweb-securityllm-agentsexploit-evaluation

Sources2026-07-04

BackdoorLLM: A Comprehensive Benchmark for Backdoor Attacks on Large Language Models

sourceseedPaperspaper

ai-securitysecurity-for-aibackdoorsbenchmarkmodel-securityllm-security

Sources2026-07-04

An Embarrassingly Simple Detector for Model Extraction Attacks in LLM APIs

sourceseedPaperspaper

ai-securitysecurity-for-aimodel-extractiondetectionllm-api-securitylatest-research

Sources2026-07-04

AI-Augmented SOC: A Survey of LLMs and Agents for Security Operations

sourceseedPaperspaper

ai-securityai-for-securitysocsecurity-operationsllm-agentsalert-triageincident-response

Sources2026-07-04

Adversarial Machine Learning: A Taxonomy and Terminology of Attacks and Mitigations

sourceseedWhitepapersgovernment_guidance

ai-securitysecurity-for-aiadversarial-machine-learningtaxonomymodel-securitygovernment-guidance

Sources2026-07-04

A Survey on Model Extraction Attacks and Defenses for Large Language Models

sourceseedPaperspaper

ai-securitysecurity-for-aimodel-extractionmodel-stealingprivacysurvey

Sources2026-07-04

A Practical Guide for Securely Using Third-Party MCP Servers

sourceseedWhitepapersstandard

ai-securitysecurity-for-aimcp-securitythird-party-toolstool-poisoningcontrols

Sources2026-07-04

Zero Trust for Agentic AI: Securing the Enterprise from the AI Transformation

sourceseedWhitepapersvendor_whitepaper

ai-securitysecurity-for-aizero-trustagent-securityidentityruntime-guardrails

Sources2026-07-04

WASP: Benchmarking Web Agent Security Against Prompt Injection Attacks

sourceseedPaperspaper

ai-securitysecurity-for-aiweb-agentsprompt-injectionbenchmarkevaluation

Sources2026-07-04

Towards Secure Systems of Interacting AI Agents

sourceseedPaperspaper

ai-securitysecurity-for-aimulti-agentagent-securityinteraction-security

Sources2026-07-04

The Attack and Defense Landscape of Agentic AI

sourceseedPaperspaper

ai-securitysecurity-for-aiagent-securityattack-landscapedefense-landscapeopen-challenges

Sources2026-07-04

Security of AI Agents

sourceseedPaperspaper

ai-securitysecurity-for-aiagent-securityarchitecturedefenses

Sources2026-07-04

Secure autonomous agentic AI systems

sourceseedWhitepapersvendor_guidance

ai-securitysecurity-for-aiagent-securityzero-trustruntime-controlsvendor-guidance

Sources2026-07-04

SAFE-AI: A Framework for Securing AI-Enabled Systems

sourceseedWhitepaperswhitepaper

ai-securitymitresafe-aithreat-mitigationai-assurance

Sources2026-07-04

PoisonedRAG: Knowledge Corruption Attacks to Retrieval-Augmented Generation of Large Language Models

sourceseedPaperspaper

ai-securitysecurity-for-airag-securitydata-poisoningknowledge-poisoningretrieval

Sources2026-07-04

OWASP Top 10 for LLM Applications 2025

sourceseedWhitepaperswhitepaper

ai-securityowaspllm-top-10prompt-injectionrisk-taxonomy

Sources2026-07-04

OWASP Top 10 for Agentic Applications 2026

sourceseedWhitepapersstandard

ai-securitysecurity-for-aiagent-securityowasptaxonomycontrols

Sources2026-07-04

OpenAI Preparedness Framework

sourceseedWhitepaperswhitepaper

ai-securityfrontier-aipreparednesscybersecurity-riskevaluations

Sources2026-07-04

Not what you've signed up for: Compromising Real-World LLM-Integrated Applications with Indirect Prompt Injection

sourceseedPaperspaper

ai-securityprompt-injectionindirect-prompt-injectionllm-applications

Sources2026-07-04

NIST AI 600-1: Artificial Intelligence Risk Management Framework: Generative Artificial Intelligence Profile

sourceseedWhitepaperswhitepaper

ai-securitynistai-rmfgenerative-aigovernancerisk-management

Sources2026-07-04

MITRE ATLAS

sourceseedWhitepaperswhitepaper

ai-securitymitre-atlasthreat-modelingadversarial-aitactics-techniques

Sources2026-07-04

InjecAgent: Benchmarking Indirect Prompt Injections in Tool-Integrated Large Language Model Agents

sourceseedPaperspaper

ai-securityprompt-injectionagentsbenchmarkstool-use

Sources2026-07-04

GraphRAG under Fire

sourceseedPaperspaper

ai-securitysecurity-for-aigraphragrag-securitypoisoningretrieval

Sources2026-07-04

Google Secure AI Framework (SAIF)

sourceseedWhitepaperswhitepaper

ai-securitygooglesaifsecure-ai-frameworkgovernance

Sources2026-07-04

Generative AI's Biggest Security Flaw Is Not Easy to Fix

sourceseedNewsnews

ai-securityprompt-injectionjournalismindustry-risk

Sources2026-07-04

From Secure Agentic AI to Secure Agentic Web: Challenges, Threats, and Future Directions

sourceseedPaperspaper

ai-securitysecurity-for-aiagentic-webagent-securityopen-challenges

Sources2026-07-04

EchoLeak: The First Real-World Zero-Click Prompt Injection Exploit

sourceseedPaperspaper

ai-securitysecurity-for-aiprompt-injectionenterprise-aidata-exfiltrationincident-analysis

Sources2026-07-04

Design Patterns for Securing LLM Agents against Prompt Injections

sourceseedPaperspaper

ai-securityagentsprompt-injectiondesign-patternsdefenses

Sources2026-07-04

Deploying AI Systems Securely: Best Practices for Deploying Secure and Resilient AI Systems

sourceseedWhitepapersgovernment_guidance

ai-securitysecurity-for-aideployment-securityresiliencegovernment-guidancecontrols

Sources2026-07-04

Defending LLM Agents Against Context-Aware Prompt Injection

sourceseedPaperspaper

ai-securityagentsprompt-injectioncontext-aware-attacksdefenses

Sources2026-07-04

CISA Tells US Agencies to Fix Security Bugs in as Little as 3 Days Thanks to AI Threats

sourceseedNewsnews

ai-securitysecurity-for-aivulnerability-managementai-threatsgovernmentnews

Sources2026-07-04

Benchmarking Prompt-Injection Attacks on Tool-Integrated LLM Agents

sourceseedPaperspaper

ai-securityprompt-injectiontool-integrated-agentsdata-exfiltrationprivacy

Sources2026-07-04

Anthropic Responsible Scaling Policy Version 3.0

sourceseedWhitepaperswhitepaper

ai-securityfrontier-airesponsible-scalingmodel-safetycybersecurity-risk

Sources2026-07-04

AI Data Security

sourceseedWhitepapersgovernment_guidance

ai-securitysecurity-for-aidata-securitydata-provenancepoisoninggovernment-guidance

Sources2026-07-04

AI Cyber Challenge marks pivotal inflection point for cyber defense

sourceseedNewsnews

ai-securityai-for-securityvulnerability-discoverypatchingai-cyber-challenge

Sources2026-07-04

AI Controls Matrix

sourceseedWhitepapersframework

ai-securitysecurity-for-aicontrolsgovernancecloud-securityframework

Sources2026-07-04

AI Agents Under Threat: A Survey of Key Security Challenges and Future Pathways

sourceseedPaperspaper

ai-securitysecurity-for-aiagent-securitysurveythreat-modeling

Sources2026-07-04

Agentic AI Security: Threats, Defenses, Evaluation, and Open Challenges

sourceseedPaperspaper

ai-securitysecurity-for-aiagent-securitysurveyevaluationdefenses

Sources2026-07-04

Agent Security Bench (ASB): Formalizing and Benchmarking Attacks and Defenses in LLM Agents

sourceseedPaperspaper

ai-securityagentsbenchmarkprompt-injectionmemory-poisoningdefenses

Find Sources, Topics, Claims, and Questions

Filters