Browse The Vault
Find Sources, Topics, Claims, and Questions
Use frontmatter-backed filters to move through the portal by document type, wiki area, status, topic tag, and time.
312 documents
48 topic tags available
No documents match the current filters.
Sources2026-07-04
MCP Security Bench (MSB): Benchmarking Attacks Against Model Context Protocol in LLM Agents
ai-security
Sources2026-07-04
Bounded Autonomy in the SOC: Mitigating Hallucinations in Agentic Incident Response via Neurosymbolic Guardrails
ai-security
Sources2026-07-04
BadAgent: Inserting and Activating Backdoor Attacks in LLM Agents
ai-security
Sources2026-07-04
AgenticCyOps: Securing Multi-Agentic AI Integration in Enterprise Cyber Operations
ai-security
Sources2026-07-04
AgentDojo: A Dynamic Environment to Evaluate Prompt Injection Attacks and Defenses for LLM Agents
ai-security
Sources2026-07-04
Agent Security Bench (ASB): Formalizing and Benchmarking Attacks and Defenses in LLM-based Agents
ai-security
Sources2026-07-04
When the Ruler is Broken: Parsing-Induced Suppression in LLM-Based Security Log Evaluation
ai-securityai-for-securityai-socopen-weight-modelsopensoc-aitinyllamaevaluation-methodology
Sources2026-07-04
SRC-20260703-open-weight-ai-soc
ai-securityai-socopen-weight-modelssource-ingest
Methods2026-07-04
SOC Evaluation Parser Audit
ai-securityai-socevaluationopen-weight-models
Research Questions2026-07-04
RQ-20260703-011-open-weight-ai-soc-evaluation
ai-securityai-socopen-weight-modelsevaluation
Claims2026-07-04
Open Weight SOC Models Need Evaluation Contracts
ai-securityai-socopen-weight-modelsevaluation
Concepts2026-07-04
Open Weight Models for AI SOC
ai-securityai-socopen-weight-models
Sources2026-07-04
Open Weight AI SOC Paper Collection
ai-securityai-for-securityai-socopen-weight-modelssource-collection
Sources2026-07-04
Llama-3.1-FoundationAI-SecurityLLM-Base-8B Technical Report
ai-securityai-for-securityai-socopen-weight-modelscybersecurity-llmfoundation-secllama-3-1
Sources2026-07-04
Llama-3.1-FoundationAI-SecurityLLM-8B-Instruct Technical Report
ai-securityai-for-securityai-socopen-weight-modelscybersecurity-llmfoundation-secinstruction-tuning
Sources2026-07-04
Evaluation of LLM Agents for the SOC Tier 1 Analyst Triage Process
ai-securityai-for-securityai-socopen-weight-modelsllama-3soc-tier-1alert-triage
Concepts2026-07-04
Threat Models
ai-securitythreat-model
Methods2026-07-04
Threat Modeling Agentic Systems
ai-securitymethodsbatch-ingest
Sources2026-07-04
SRC-20260702-karpathy-llm-wiki
ai-securityllm-wikiknowledge-management
Sources2026-07-04
Source Intake
ai-securitysource-intake
Methods2026-07-04
Runtime Monitoring and Agent Gateways
ai-securitymethodsbatch-ingest
Research Questions2026-07-04
RQ-20260702-010-agent-runtime-monitoring
ai-securitybatch-ingest
Research Questions2026-07-04
RQ-20260702-009-ai-soc-human-factors
ai-securitybatch-ingest
Research Questions2026-07-04
RQ-20260702-008-rag-poisoning-controls
ai-securitybatch-ingest
Research Questions2026-07-04
RQ-20260702-007-action-scoped-authorization
ai-securitybatch-ingest
Research Questions2026-07-04
RQ-20260702-006-benchmark-to-incident-validity
ai-securitybatch-ingest
Research Questions2026-07-04
RQ-20260702-005-memory-poisoning-defense
ai-securitybatch-ingest
Research Questions2026-07-04
RQ-20260702-004-agent-protocol-security
ai-securitybatch-ingest
Research Questions2026-07-04
RQ-20260702-003-defense-generalization
ai-securitydefenses
Research Questions2026-07-04
RQ-20260702-002-benchmark-validity
ai-securityevaluation
Research Questions2026-07-04
RQ-20260702-001-agent-security
ai-securityagents
Research Questions2026-07-04
Research Questions Index
ai-securityresearch-questions
Methods2026-07-04
Red Teaming Agentic AI
ai-securitymethodsbatch-ingest
Sources2026-07-04
Raw Whitepapers Batch Ingest
ai-securitybatch-ingestwhitepapers
Sources2026-07-04
Raw Papers Batch Ingest
ai-securitybatch-ingestpapers
Sources2026-07-04
Raw News Batch Ingest
ai-securitybatch-ingestnews
Synthesis2026-07-04
Raw Corpus Synthesis 2026-07-02
ai-securitybatch-ingestsynthesis
Concepts2026-07-04
RAG and Retrieval Security
ai-securitybatch-ingest
Claims2026-07-04
Prompt Injection Defenses Depend On Deployment Context
ai-securityclaimbatch-ingest
Concepts2026-07-04
Prompt Injection and Context Attacks
ai-securitybatch-ingest
Claims2026-07-04
Persistent Memory Creates Poisoning And Provenance Risks
ai-securityclaimbatch-ingest
Sources2026-07-04
PAIEL: Protocol-Aware and Context-Integrated Protocol Explanation Using LLMs for SOCs
ai-securityai-for-securityai-socprotocol-analysisragstructured-contextcontext-compression
Sources2026-07-04
Non-Disruptive Disruption: An Empirical Experience of Introducing LLMs in the SOC
ai-securityai-for-securityai-socethnographyhuman-ai-collaborationco-creation
Concepts2026-07-04
Model Extraction and Privacy Leakage
ai-securitybatch-ingest
Methods2026-07-04
Methods Index
ai-securitymethods
Concepts2026-07-04
Memory Poisoning and Agent State
ai-securitybatch-ingest
Concepts2026-07-04
MCP and Agent Protocol Security
ai-securitybatch-ingest
Portal2026-07-04
log
ai-securitylog
Sources2026-07-04
InkJect: The Visual Prompt Injection That Text Defenses Were Never Built to Stop
ai-securityvisual-prompt-injectionindirect-prompt-injectionvlmmultimodal-agent
Portal2026-07-04
index
ai-securityindex
Methods2026-07-04
Evidence Grading for AI Security
ai-securitymethodsbatch-ingest
Concepts2026-07-04
Evaluation Benchmarks for AI Security
ai-securitybatch-ingest
Portal2026-07-04
Entities Index
ai-securityentities
Synthesis2026-07-04
Current Synthesis
ai-securitysynthesis
Concepts2026-07-04
Concepts Index
ai-securityconcepts
Sources2026-07-04
Cognitive Threat Detection for SOC Operations: Automating Manipulation Tactic Analysis in Election Security
ai-securityai-for-securityai-socelection-securitycognitive-threatllm-routing
Claims2026-07-04
Claims Index
ai-securityclaims
Claims2026-07-04
Benchmarks May Not Predict Deployment Risk
ai-securityclaimbatch-ingest
Methods2026-07-04
Benchmark-Based Security Evaluation
ai-securitymethodsbatch-ingest
Concepts2026-07-04
AI Security Taxonomy
ai-securitytaxonomy
Portal2026-07-04
AI Security Research Portal
ai-securityportal
Concepts2026-07-04
AI Security Governance and Standards
ai-securitybatch-ingest
Concepts2026-07-04
AI Cybersecurity Operations
ai-securitybatch-ingest
Claims2026-07-04
Agentic Systems Expand The Security Boundary
ai-securityclaimbatch-ingest
Concepts2026-07-04
Agent Security and Tool Abuse
ai-securitybatch-ingest
Concepts2026-07-04
Agent Identity and Authorization
ai-securitybatch-ingest
Claims2026-07-04
Agent Authorization Should Be Action Scoped
ai-securityclaimbatch-ingest
Sources2026-07-04
True Attacks, Attack Attempts, or Benign Triggers? An Empirical Measurement of Network Alerts in a Security Operations Center
ai-securityai-for-securityai-socalert-triageempirical-measurementground-truth
Sources2026-07-04
That Escalated Quickly: An ML Framework for Alert Prioritization
ai-securityai-for-securityai-socmachine-learningalert-prioritizationmanaged-security
Sources2026-07-04
OpenSOC-AI: Democratizing Security Operations with Parameter Efficient LLM Log Analysis
ai-securityai-for-securityai-socllmloralog-analysissmbs
Sources2026-07-04
NLLog: Lightweight, Explainable SOC Anomaly Detection via Log-to-Language Rewriting
ai-securityai-for-securityai-socanomaly-detectionexplainabilitylog-analysis
Sources2026-07-04
LanG -- A Governance-Aware Agentic AI Platform for Unified Security Operations
ai-securityai-for-securityai-socagentic-aigovernancemcphuman-in-the-loop
Sources2026-07-04
Improved Detection and Response via Optimized Alerts: Usability Study
ai-securityai-for-securityai-socmachine-learningalert-fatigueusability
Sources2026-07-04
DEEPCASE: Semi-Supervised Contextual Analysis of Security Events
ai-securityai-for-securityai-socdeep-learningevent-correlationsemi-supervised-learning
Sources2026-07-04
Context2Vector: Accelerating security event triage via context representation learning
ai-securityai-for-securityai-socrepresentation-learningalert-triagehuman-in-the-loop
Sources2026-07-04
An Assessment of the Usability of Machine Learning Based Tools for the Security Operations Center
ai-securityai-for-securityai-socmachine-learningusabilityhuman-ai-collaboration
Sources2026-07-04
AI and Pentesting Pulse Report 2026
ai-securityllm-pentestingautomated-scanningfalse-negativeremediation
Sources2026-07-04
A user-centric machine learning framework for cyber security operations center
ai-securityai-for-securityai-socmachine-learningalert-triageuser-centric
Sources2026-07-04
SIR-Bench: Evaluating Investigation Depth in Security Incident Response Agents
ai-securityai-for-securityai-socincident-responseincident-replaybenchmarkforensic-investigation
Sources2026-07-04
Severity-based triage of cybersecurity incidents using kill chain attack graphs
ai-securityai-for-securityai-socalert-triageattack-graphmitre-attackincident-replay
Sources2026-07-04
Security risk management in the digital enterprise: enhancing cyber defense with large language models
ai-securityai-for-securityai-socllmnetwork-telemetrydeployment-evaluationq1-journal
Sources2026-07-04
Securing AI Agents with Cisco AI Defense
ai-securityai-agent-securityruntime-protectionmcpprompt-injectionguardrails
Sources2026-07-04
Large Language Models Can Provide Accurate and Interpretable Incident Triage
ai-securityai-for-securityincident-triagellmcloud-operationsinterpretabilityconference
Sources2026-07-04
Integrating Large Language Models into Security Incident Response
ai-securityai-for-securityai-socincident-responsehuman-ai-collaborationsummarizationconference
Sources2026-07-04
Carbon Filter: Scalable, Efficient, and Secure Alert Triage for Endpoint Detection & Response
ai-securityai-for-securityai-socalert-triageendpoint-detection-responseclusteringconference
Sources2026-07-04
Alert Fatigue in Security Operations Centres: Research Challenges and Opportunities
ai-securityai-for-securityai-socalert-fatiguehuman-ai-collaborationsurveyq1-journal
Sources2026-07-04
AI SOC Q1 Journal and Peer-Reviewed Conference Collection
ai-securityai-for-securityai-socq1-journalpeer-reviewedcollection-manifest
Sources2026-07-04
AECR: Automatic attack technique intelligence extraction based on fine-tuned large language model
ai-securityai-for-securityai-soccyber-threat-intelligencemitre-attackllmq1-journal
Sources2026-07-04
SafeClawBench: Separating Semantic, Audit-Evidence, and Sandbox Harm in Tool-Using LLM Agents
ai-securitysecurity-for-aibenchmarkagent-securityprompt-injectionmemory-poisoningevaluation
Sources2026-07-04
Prompt Injection in Automated Résumé Screening with Large Language Models: Single and Multi-Injection Settings
ai-securitysecurity-for-aiprompt-injectionhiring-workflowdecision-integritypeer-reviewed
Sources2026-07-04
PI-Hunter: Automated Red-Teaming for Exposing and Localizing Prompt Injections
ai-securitysecurity-for-aiprompt-injectionred-teaminglocalizationagent-security
Sources2026-07-04
MCP Auto-Execution: From Git Clone to Cloud Compromise in Amazon Q VS Code Extension
ai-securityamazon-qmcpcoding-agentworkspace-trustcredential-theftcve-2026-12957
Sources2026-07-04
LivePI: More Realistic Benchmarking of Agents Against Indirect Prompt Injection
ai-securitysecurity-for-aiindirect-prompt-injectionexecutable-harmvirtual-machinebenchmark
Sources2026-07-04
How Vulnerable Are AI Agents to Indirect Prompt Injections? Insights from a Large-Scale Public Competition
ai-securitysecurity-for-aiindirect-prompt-injectionred-teamingcomputer-useconcealmentbenchmark
Sources2026-07-04
GitInject: Real-World Prompt Injection Attacks in AI-Powered CI/CD Pipelines
ai-securitysecurity-for-aiprompt-injectionci-cdsupply-chaincoding-agentsbenchmark
Sources2026-07-04
DecodingTrust-Agent Platform (DTap): A Controllable and Interactive Red-Teaming Platform for AI Agents
ai-securitysecurity-for-aiagent-securityred-teamingprompt-injectiontool-injectionskill-injection
Sources2026-07-04
Authorization Propagation in Multi-Agent AI Systems: Identity Governance as Infrastructure
ai-securitysecurity-for-aimulti-agent-systemsauthorizationidentity-governancedelegation
Sources2026-07-04
AI Security Paper Collection 2026-06-29
ai-securitycollectionpapersweekly-ingest
Sources2026-07-04
AI Agents May Always Fall for Prompt Injections
ai-securitysecurity-for-aiprompt-injectioncontextual-integrityinformation-flowdefense-limitations
Sources2026-07-04
AgentDyn: Are Your Agent Security Defenses Deployable in Real-World Dynamic Environments?
ai-securitysecurity-for-aiagent-securityprompt-injectionbenchmarkdynamic-tasks
Sources2026-07-04
Introducing computer use in Gemini 3.5 Flash
ai-securitycomputer-usegemini-3-5-flashprompt-injectionagent-security
Sources2026-07-04
Chinese cybersecurity company 360 unveils “China's version of Mythos”, and Yitianzhen, to automate cyber defense
ai-securitycyber-modelvulnerability-discoveryautomated-defensedual-use
Sources2026-07-04
OpenAI limits its latest ChatGPT product to Trump-approved customers during cybersecurity review
ai-securityfrontier-modelscyber-capabilityphased-releasegovernance
Sources2026-07-04
Exclusive: Gottheimer and Moolenaar roll out AI cloud security bill
ai-securitycloud-computemodel-developmentmisuse-detectionpolicy
Sources2026-07-04
We have Mythos at Home: GLM 5.2 beats Claude in our Cyber Benchmarks
ai-securitycyber-benchmarksvulnerability-detectionidorglm-5-2
Sources2026-07-04
Toward Autonomous SOC Operations: End-to-End LLM Framework for Threat Detection, Query Generation, and Resolution in Security Operations
ai-securityai-for-securityai-socautonomous-socquery-generationsiemrag
Sources2026-07-04
SuperLocalMemory: Privacy-Preserving Multi-Agent Memory with Bayesian Trust Defense Against Memory Poisoning
ai-securitymulti-agent-memorymemory-poisoningbayesian-trustarchitectural-isolationprovenancemcp
Sources2026-07-04
State Contamination in Memory-Augmented LLM Agents
ai-securitystate-contaminationmemory-launderingmulti-agent-rolloutspersistent-statememory-poisoningmas-misevolution-propagation
Sources2026-07-04
Self-Evolving Multi-Agent Systems via Decentralized Memory
ai-securitymulti-agentself-evolving-agentsdecentralized-memorypersistent-memoryllm-as-a-judgemas-misevolution-propagation
Sources2026-07-04
Retrieval-Augmented LLMs for Security Incident Analysis
ai-securityai-for-securityai-socsecurity-incident-analysisragmitre-attacklog-analysis
Sources2026-07-04
On the Resilience of LLM-Based Multi-Agent Collaboration with Faulty Agents
ai-securitymulti-agentfaulty-agentsresilienceautoinjectautotransforminspector
Sources2026-07-04
Memory Poisoning Propagation and Repair Mechanism in Multi-Agent Collaborative Environments
ai-securitymemory-poisoningpropagationmulti-agentevidence-graphrepaircontrastive-learning
Sources2026-07-04
Memory poisoning and secure multi-agent systems
ai-securitymemory-poisoningmulti-agentsecure-massemantic-memoryepisodic-memorymas-misevolution-propagation
Sources2026-07-04
MAS Misevolution Propagation Collection 2026-06-26
ai-securitycollectionmas-misevolution-propagationmulti-agentmemory-poisoningerror-cascadeself-evolving-agents
Sources2026-07-04
macOS.Gaslight | Rust Backdoor Turns Prompt Injection on the Analyst, Not the Sandbox
ai-securityprompt-injectionmalware-analysismacosdprk
Sources2026-07-04
Like a Hammer, It Can Build, It Can Break: Large Language Model Uses, Perceptions, and Adoption in Cybersecurity Operations on Reddit
ai-securityai-for-securityai-socpractitioner-studyadoptionreddithuman-ai-collaboration
Sources2026-07-04
Large Language Models for Security Operations Centers: A Comprehensive Survey
ai-securityai-for-securityai-socsurveysecurity-operationsllmthreat-intelligence
Sources2026-07-04
IRCopilot: Automated Incident Response with Large Language Models
ai-securityai-for-securityai-socincident-responsellm-agenthallucinationprivacy
Sources2026-07-04
GLM 5.2 on CyberBT-CTF: The strongest open source contender to Anthropic/OpenAI we have tested
ai-securitycyber-benchmarksopen-weight-modelsglm-5-2model-distillation
Sources2026-07-04
From Spark to Fire: Modeling and Mitigating Error Cascades in LLM-Based Multi-Agent Collaboration
ai-securitymulti-agenterror-cascadepropagationgenealogy-graphllm-masmas-misevolution-propagation
Sources2026-07-04
CORTEX: Collaborative LLM Agents for High-Stakes Alert Triage
ai-securityai-for-securityai-socalert-triagemulti-agentauditabilityproduction-soc
Sources2026-07-04
Collaborative Memory: Multi-User Memory Sharing in LLM Agents with Dynamic Access Control
ai-securitycollaborative-memorymulti-agentaccess-controlprovenanceshared-memoryauditability
Sources2026-07-04
Before You Hand Over the Wheel: Evaluating LLMs for Security Incident Analysis
ai-securityai-for-securityai-socsecurity-incident-analysisalert-triagebenchmarkagentic-evaluation
Sources2026-07-04
Anthropic accuses Alibaba of running the largest distillation campaign yet against Claude
ai-securitymodel-extractionmodel-distillationclaudeqwenalibaba
Sources2026-07-04
AI-Driven Guided Response for Security Operation Centers with Microsoft Copilot for Security
ai-securityai-for-securityai-socguided-responsemicrosoft-security-copilotincident-triageremediation
Sources2026-07-04
AgentSOC: A Multi-Layer Agentic AI Framework for Security Operations Automation
ai-securityai-for-securityai-socagentic-socsecurity-operationsincident-responserisk-based-response
Sources2026-07-04
Your Agent May Misevolve: Emergent Risks in Self-evolving LLM Agents
ai-securityself-evolving-agentsmisevolutionagent-securitymemorytoolsworkflow-evolution
Sources2026-07-04
When Poison Fails After Retrieval: Revisiting Corpus Poisoning under Chunking and Reranking Pipelines
ai-securityragcorpus-poisoningchunkingrerankingretrieval
Sources2026-07-04
When AUC 0.998 Is Not Enough: A Candidate Evaluation Protocol for Hidden-State Probes of Indirect Prompt Injection in Multimodal Computer-Use Agents
ai-securityindirect-prompt-injectionmultimodal-agentcomputer-use-agenthidden-state-probesevaluation
Sources2026-07-04
What Intermediate Layers Know: Detecting Jailbreaks from Entropy Dynamics
ai-securityjailbreakdetectionmechanistic-interpretabilityguardrails
Sources2026-07-04
TrustedARI: Towards Trust-Native Agentic Routing Infrastructure for Agentic AI
ai-securityagentic-airoutingtrust-infrastructuremulti-agent-systems
Sources2026-07-04
Tracing Target Answers in Poisoned Retrieval Corpora via Token Influence Attribution
ai-securityragretrieval-poisoningattributionprovenance
Sources2026-07-04
The State of AI Security Report 2026
ai-securityindustry-reportthreat-intelligencegovernanceenterprise-ai
Sources2026-07-04
The State of AI Cybersecurity 2026
ai-securityai-for-securitysocsecurity-operationsciso-surveyindustry-report
Sources2026-07-04
The Injection Paradox: Brand-Level Suppression in Safety-Trained LLM Recommendations via RAG Context Injection
ai-securityragcontext-injectionrecommendationprompt-injection
Sources2026-07-04
SS-ZKR: Spatial-Semantic Zero-Knowledge Routing for Privacy-Preserving Multi-Agent Collaboration
ai-securitymulti-agent-systemsprivacyroutingzero-knowledgea2amcp
Sources2026-07-04
SoK: The Attack Surface of Agentic AI -- Tools, and Autonomy
ai-securityagentic-aiattack-surfacetoolsragautonomymulti-agent-security
Sources2026-07-04
SMSR: Certified Defence Against Runtime Memory Poisoning in Persistent LLM Agent Systems
ai-securityagent-memorymemory-poisoningcertified-defensepersistent-agents
Sources2026-07-04
Security and Privacy in Retrieval-Augmented Generation: Architectures, Threats, Defenses, and Future Directions for Building Trustworthy Systems
ai-securityragprivacysurveythreat-modeldefense
Sources2026-07-04
Securing LLM-Agent Long-Term Memory Against Poisoning: Non-Malleable, Origin-Bound Authority with Machine-Checked Guarantees
ai-securityagent-memorymemory-poisoningformal-methodsprovenance
Sources2026-07-04
Securing Agentic AI
ai-securityagentic-aicontrolsenterprise-aigovernanceruntime-security
Sources2026-07-04
Scalable Hierarchical Attention Transformers for Multi-Turn Jailbreak Detection in Long Conversations
ai-securityjailbreakmulti-turnlong-contextdetection
Sources2026-07-04
Same-Origin Policy for Agentic Browsers
ai-securityagentic-browsersame-origin-policyweb-securityprompt-injection
Sources2026-07-04
Safe to Check, Unsafe to Use: Relinking at the Compression Boundary of LLM Agents
ai-securityllm-agentcontext-compressionprompt-injectionagent-memory
Sources2026-07-04
REALM: A Unified Red-Teaming Benchmark for Physical-World VLMs
ai-securityvlmred-teamingbenchmarkmultimodal-security
Sources2026-07-04
RAVEN: Agentic RAG for Automated Vulnerability Repair
ai-securityai-for-securityvulnerability-repairagentic-ragsoftware-security
Sources2026-07-04
RAILS: Verification-Native Clearing For Agentic Commerce
ai-securityagentic-commerceverificationsettlement-riskagent-integritynon-human-identity
Sources2026-07-04
Privacy-Preserving RAG via Multi-Agent Semantic Rewriting: Achieving Confidentiality Without Compromising Contextual Fidelity
ai-securityragprivacymulti-agent-systemssemantic-rewriting
Sources2026-07-04
Poisoned Playbooks: Demystifying Knowledge Poisoning Effects on AI Security Agents
ai-securityai-for-securitysoc-agentknowledge-poisoningplaybooks
Sources2026-07-04
PixJail: Self-Evolving Paper-to-Pipeline Reproduction for Text-to-Image Jailbreak Evaluation
ai-securitytext-to-imagejailbreakevaluationself-evolving
Sources2026-07-04
OTTER: A Red-Teaming System for Toxicity-Evading Jailbreak Prompt Optimization
ai-securityjailbreakred-teamingprompt-optimizationsafety-evaluation
Sources2026-07-04
OpenAgenet / OAN Yellow Paper: Technical Architecture for Trust-Governed Resource Identity and Discovery
ai-securityagent-identityresource-discoverytrust-layera2amcpskills
Sources2026-07-04
One Goal, Many Commands: Characterizing Denylist Fragility in AI Agents
ai-securityagent-securitydenylistpolicy-enforcementtool-use
Sources2026-07-04
More Malicious OpenClaw Skills Threaten AI Supply Chain
ai-securityagentic-aiopenclawmalicious-skillssupply-chainnews
Sources2026-07-04
Let Them Steal: Trapping Large Language Model Extraction Attacks with Knowledge Honeypot
ai-securitymodel-extractionhoneypotllmdefense
Sources2026-07-04
Latest AI Security Collection 2026-06-25
ai-securitycollection-manifestlatestagentic-aimcpai-for-security
Sources2026-07-04
Influence Factors on RAG Poisoning
ai-securityragpoisoningretrievalevaluation
Sources2026-07-04
How Reliable Is Your Jailbreak Judge? Calibration and Adversarial Robustness of Automated ASR Scoring
ai-securityjailbreakevaluationasrllm-judgecalibration
Sources2026-07-04
Honeyquest for LLMs: Rethinking Cyber Deception for AI Attackers
ai-securityai-for-securitycyber-deceptionllm-attackershoneypotthreat-intelligence
Sources2026-07-04
Global Cybersecurity Outlook 2026
ai-securitycybersecurity-trendspolicycyber-readinessindustry-report
Sources2026-07-04
GIF: Locally Sound Geometric Information Flow Control for LLMs
ai-securityinformation-flow-controlllmdata-leakageformal-methods
Sources2026-07-04
Ghost Vectors: Soft-Deleted Embeddings Remain Reconstructible in HNSW Vector Databases
ai-securityvector-databaseembeddingsprivacyragdata-deletion
Sources2026-07-04
Game-Theoretic Multi-Agent Control for Robust Contextual Reasoning in LLMs
ai-securitymcpcontext-poisoningprompt-injectionmulti-agent-controlrollback
Sources2026-07-04
From Privacy to Workflow Integrity: Communication-Graph Metadata in Autonomous Agent Interoperability
ai-securitymulti-agent-systemsmetadata-leakageworkflow-integritya2amcp
Sources2026-07-04
Document-Authored Control-Signal Impersonation: A Low-Cost Indirect Prompt Attack on RAG Safety Boundaries
ai-securityragindirect-prompt-injectioncontrol-signalsafety-boundary
Sources2026-07-04
Detecting Malicious Agent Skills in the Wild using Attention
ai-securityagent-skillsmalicious-skillssupply-chaindetection
Sources2026-07-04
Cybersecurity Forecast 2026
ai-securityai-for-securitythreat-forecastsocsecurity-operationscybersecurity-trends
Sources2026-07-04
Conflict-Aware Retriever Editing for Knowledge Injection Attacks on LLM-Based RAG Systems
ai-securityragknowledge-injectionretriever-editingpoisoning
Sources2026-07-04
Confidently Wrong: Severity-Aware Calibration of Prompt-Injection Detectors under Attack Shift
ai-securityprompt-injectiondetector-calibrationrobustnessevaluation
Sources2026-07-04
Code-Augur: Agentic Vulnerability Detection via Specification Inference
ai-securityai-for-securityvulnerability-detectionagentic-aispecification-inferencesoftware-security
Sources2026-07-04
Behind the Curtain: Global AI wars
ai-securityfrontier-aicyber-capabilitygeopoliticsfive-eyesnews
Sources2026-07-04
An Evaluation of Data Leakage Risks in Tool-Using LLM Agents in Realistic Scenarios
ai-securitytool-using-agentdata-leakageprivacyagent-security
Sources2026-07-04
AIChilles: Automatically Uncovering Hidden Weaknesses in AI-Evolved Systems
ai-securityself-evolving-aiweakness-discoveryai-evolved-systemstesting
Sources2026-07-04
AI Snitches Get Glitches: Towards Evading Agentic Surveillance
ai-securityagentic-aisurveillance-evasionadversarialmonitoring
Sources2026-07-04
AI Security Paper Collection 2026-06-25
ai-securitypaperscollection-manifestragagent-securityjailbreakai-for-security
Sources2026-07-04
Agents All the Way Down; A Methodology for Building Custom AI Agents from Substrate to Production
ai-securityai-technologyagentic-aicustom-agentsagent-methodologysecurity-boundariesaudit-trail
Sources2026-07-04
AgentRiskBOM: A Risk-Scoping Security Bill of Materials for Agentic AI Systems
ai-securityagentic-aisbomrisk-managementgovernance
Sources2026-07-04
AgentLens: Interpretable Safety Steering via Mechanistic Subspaces for Multi-Turn Coding Agent
ai-securitycoding-agentsafety-steeringinterpretabilitymulti-turn-agent
Sources2026-07-04
AgenticOS: An Intent-Oriented Secure Operating System Architecture for Autonomous AI Agents
ai-securityagentic-aisecure-osintentruntime-security
Sources2026-07-04
AgentCyberRange: Benchmarking Frontier AI Systems in Realistic Cyber Ranges
ai-securityai-for-securitycyber-rangebenchmarkfrontier-aicyber-capability
Sources2026-07-04
AgentCanary: A Security Evaluation Framework for Autonomous AI Agents in Real Executable Environments
ai-securityagent-securitybenchmarkexecutable-environmentevaluation
Sources2026-07-04
Agent-Assisted Side-Channel Attacks on Non-Prefix KV Cache in RAG
ai-securityragside-channelkv-cacheagent-assisted-attack
Sources2026-07-04
A Survey of Self-Evolving Agents: What, When, How, and Where to Evolve on the Path to Artificial Super Intelligence
ai-securityself-evolving-agentssurveytaxonomyagent-memoryagent-toolsworkflow-evolution
Sources2026-07-04
A Layered Security Framework Against Prompt Injection in RAG-Based Chatbots
ai-securityragprompt-injectiondefensechatbot-security
Sources2026-07-04
\"What Happens Locally, Leaks Globally\": Detecting Privacy Leakage Risks in MCP Servers
ai-securitymcpprivacy-leakagestatic-analysistool-securityagent-security
Sources2026-07-04
ZERO-APT: A Closed-Loop Adversarial Framework for LLM-Driven Automated Penetration Testing under Intelligent Defense
ai-securityai-for-securitycyber-benchmarkpenetration-testingdefender-in-the-loopauditability
Sources2026-07-04
What If Prompt Injection Never Left? Exploring Cross-Session Stored Prompt Injection in Agentic Systems
ai-securitysecurity-for-aiprompt-injectionpersistent-contextagent-memorybenchmark
Sources2026-07-04
Voyager: An Open-Ended Embodied Agent with Large Language Models
ai-securitylifelong-learningembodied-agentskill-libraryexecutable-codeautomatic-curriculum
Sources2026-07-04
Self-Evolving Agent Rollout and Experience Buffer Collection
ai-securityself-evolving-agentrollout-bufferexperience-memoryattack-surfacecollection
Sources2026-07-04
SAGE: Multi-Agent Self-Evolution for LLM Reasoning
ai-securityself-evolving-agentmulti-agentcurriculum-poolcriticverifier
Sources2026-07-04
Role-Agent: Bootstrapping LLM Agents via Dual-Role Evolution
ai-securityself-evolving-agentco-evolutionrollout-trajectoriesfailure-historycurriculum
Sources2026-07-04
Reflexion: Language Agents with Verbal Reinforcement Learning
ai-securityverbal-reinforcement-learningepisodic-memory-bufferreflectiontrajectory
Sources2026-07-04
MemoryGraft: Persistent Compromise of LLM Agents via Poisoned Experience Retrieval
ai-securitymemory-poisoningexperience-retrievalpersistent-compromiserollout-buffer-security
Sources2026-07-04
Memory Poisoning Attack and Defense on Memory Based LLM-Agents
ai-securitymemory-poisoningquery-only-attackmemory-sanitizationtrust-aware-retrievaltemporal-decay
Sources2026-07-04
MemMorph: Tool Hijacking in LLM Agents via Memory Poisoning
ai-securitymemory-poisoningtool-hijackingtool-selectionaccumulated-experiencepersistent-state
Sources2026-07-04
MemLineage: Lineage-Guided Enforcement for LLM Agent Memory
ai-securitymemory-lineageprovenancemerkle-logderivation-dagsensitive-action-gate
Sources2026-07-04
MemEvoBench: Benchmarking Safety Risks from Memory Misevolution in LLM Agents
ai-securitymemory-misevolutionbenchmarkbiased-feedbacknoisy-toolslong-horizon-safety
Sources2026-07-04
MemAudit: Post-hoc Auditing of Poisoned Agent Memory via Causal Attribution and Structural Anomaly Detection
ai-securitymemory-auditcausal-attributionanomaly-detectioncounterfactual-replaypoisoning
Sources2026-07-04
From Untrusted Input to Trusted Memory: A Systematic Study of Memory Poisoning Attacks in LLM Agents
ai-securitysecurity-for-aiagent-memorymemory-poisoningbenchmarkpersistent-context
Sources2026-07-04
From package to postinstall payload: Inside the Mastra npm supply chain compromise by Sapphire Sleet
ai-securitymastranpmsupply-chainpostinstallci-cdcredential-theft
Sources2026-07-04
ExpeL: LLM Agents Are Experiential Learners
ai-securityexperiential-learningexperience-poolsuccessful-trajectoriesfaissretrieval
Sources2026-07-04
Efficient and Sound Probabilistic Verification for AI Agents
ai-securitysecurity-for-airuntime-verificationguardrailspolicy-enforcementdatalog
Sources2026-07-04
CoEvolve: Training LLM Agents via Agent-Data Mutual Evolution
ai-securityself-evolving-agentagent-data-coevolutionrollout-trajectoriesuncertaintyforgetting
Sources2026-07-04
Benchmarking Security Risk Detection and Verification in Open Agentic Skill Ecosystems
ai-securitysecurity-for-aiagent-skillssupply-chainbenchmarksandboxruntime-verification
Sources2026-07-04
AgentPoison: Red-teaming LLM Agents via Poisoning Memory or Knowledge Bases
ai-securityagent-memoryknowledge-base-poisoningbackdoorretrieval-triggerred-teaming
Sources2026-07-04
AgentEvolver: Towards Efficient Self-Evolving Agent System
ai-securityself-evolving-agentreinforcement-learningexperience-poolrollout-buffertrajectory-attribution
Sources2026-07-04
Pickle in the Middle - Hijacking Vertex AI Model Uploads for Cross-Tenant RCE
ai-securityvertex-aimodel-artifactbucket-squattingpicklercecloud-security
Sources2026-07-04
Securing the future of AI agents
ai-securityai-controlagent-securitymonitoringinsider-threatdefense-in-depth
Sources2026-07-04
AutoJack: How a single page can RCE the host running your AI agent
ai-securityagent-securityautogen-studiomcpwebsocketrcelocalhost
Sources2026-07-04
When Your AI Agent's Memory Becomes a Security Liability
ai-securitysecurity-for-ailanggraphagent-memorycheckpointerrcesql-injection
Sources2026-07-04
The Meta hack shows there's more to AI security than Mythos
ai-securitysecurity-for-aiai-agentaccount-recoveryidentity-verificationaccount-takeoverincident
Sources2026-07-04
Securing the Agentic AI Frontier: Palo Alto Networks and Databricks Deliver a New Standard for AI Security
ai-securitysecurity-for-aiagentic-airuntime-securityai-gatewaymcpdata-security
Sources2026-07-04
Prompt injection still drives most agentic AI security failures in production
ai-securitysecurity-for-aiagentic-aiprompt-injectioncoding-agentssupply-chainincidents
Sources2026-07-04
Duo Brings Identity and Authorization Across AI Agent Gateways
ai-securitysecurity-for-aiagent-identitynon-human-identityauthorizationai-gatewaymcp
Sources2026-07-04
Build your own vulnerability harness
ai-securityai-for-securityvulnerability-discoveryagent-orchestrationvalidation
Sources2026-07-04
AWS Security Agent adds threat modeling, Kiro power and Claude Code plugin, and more
ai-securityai-for-securitysecurity-agentthreat-modelingstridecode-reviewmcp
Sources2026-07-04
AI Security News Collection 2026-06-19
ai-securitynewscollectiontrend-monitoring
Sources2026-07-04
AI Agent Identity and Permission Challenges: How Uber and Auth0 Are Rethinking Access Control
ai-securitysecurity-for-aimulti-agentagent-identitydelegated-authorityoauthmcp
Sources2026-07-04
Security Requirements for AI Agents
ai-securitysecurity-for-aimulti-agenta2aagent-identityaccess-controlstandards-draft
Sources2026-07-04
Arcade Raises $60M to Become the Secure Action Layer Behind Every Production AI Agent
ai-securitysecurity-for-aiagent-authorizationmcpgovernanceauditabilitymarket-signal
Sources2026-07-04
Anthropic AI dispute sparks concerns about U.S. cybersecurity defenses
ai-securityai-for-securitycyber-defensepolicymodel-capabilitynews
Sources2026-07-04
Visualizing Secure MLOps (MLSecOps): A Practical Guide for Building Robust AI/ML Pipeline Security
ai-securityglossary-gapmlsecopsmlopsllmopsai-supply-chainmodel-security
Sources2026-07-04
Transitioning from MLOps to LLMOps: Navigating the Unique Challenges of Large Language Models
ai-securityglossary-gapllmopsmlopslarge-language-modelai-operations
Sources2026-07-04
NIST AI RMF Playbook
ai-securityglossary-gapai-governanceai-risk-managementnist-ai-rmfgovern-map-measure-manage
Sources2026-07-04
Model Retraining upon Concept Drift Detection in Network Traffic Data Streams
ai-securityglossary-gapmodel-driftconcept-driftnetwork-securitymlopsanomaly-detection
Sources2026-07-04
GenAI Red Teaming Guide
ai-securityglossary-gapai-red-teaminggenai-securityowaspevaluation
Sources2026-07-04
Explainable AI in Cybersecurity Operations: Lessons Learned from User Studies
ai-securityglossary-gapexplainable-aixaisoccybersecurity-operationsanalyst-decision-support
Sources2026-07-04
CyberGym: Evaluating AI Agents' Cybersecurity Capabilities with Real-World Vulnerabilities at Scale
ai-securitycybergymai-for-securitycyber-benchmarkvulnerability-reproductionai-agentsoss-fuzz
Sources2026-07-04
AI Agents Are Getting Better at Writing Code—and Hacking It as Well
ai-securitycybergymai-agentszero-daycyber-capabilitydual-usenews
Sources2026-07-04
AgentOps: Enabling Observability of LLM Agents
ai-securityglossary-gapagentopsai-agent-observabilityllm-agentsai-safety
Sources2026-07-04
You Live More Than Once: Towards Hierarchical Skill Meta-Evolving
ai-securityskill-evolvingmeta-evolvingagent-skillstest-time-learningself-evolving-agents
Sources2026-07-04
The AI Scientist: Towards Fully Automated Open-Ended Scientific Discovery
ai-securityai-scientistautomated-scientific-discoveryopen-ended-researchself-evolving-aiai-technology
Sources2026-07-04
SkillsBench: Benchmarking How Well Agent Skills Work Across Diverse Tasks
ai-securityagent-skillsskillsbenchself-generated-skillsbenchmarkself-evolving-agents
Sources2026-07-04
SkillOpt: Executive Strategy for Self-Evolving Agent Skills
ai-securityskilloptagent-skillsself-evolving-agentstext-space-optimizationmicrosoft
Sources2026-07-04
Poison with Style: A Practical Poisoning Attack on Code Large Language Models
ai-securitysecurity-for-aidata-poisoningcode-llmbackdoorcode-agents
Sources2026-07-04
Microsoft's open-source SkillOpt automatically upgrades AI agent skills without touching model weights
ai-securityskilloptnewsself-evolving-agentsmicrosoftagent-skills
Sources2026-07-04
LLMs in the SOC: An Empirical Study of Human-AI Collaboration in Security Operations Centres
ai-securityai-for-securitysocllmhuman-ai-collaborationtriage
Sources2026-07-04
From Raw Experience to Skill Consumption: A Systematic Study of Model-Generated Agent Skills
ai-securityskilllensmodel-generated-skillsagent-skillsself-evolving-agentsmicrosoft
Sources2026-07-04
Experiences of Using Agentic AI to Fill Tooling Gaps in a Security Operations Center
ai-securityai-for-securitysocai-agentreact-agentalert-triageprompt-iteration
Sources2026-07-04
Data Agents Under Attack: Vulnerabilities in LLM-Driven Analytical Systems
ai-securitysecurity-for-aidata-agentsagent-securitydatabase-security
Sources2026-07-04
Darwin Godel Machine: Open-Ended Evolution of Self-Improving Agents
ai-securityself-improving-agentsopen-ended-evolutioncoding-agentsrecursive-self-improvementai-technology
Sources2026-07-04
CyberGym-E2E: Scalable Real-World Benchmark for AI Agents' End-to-End Cybersecurity Capabilities
ai-securityai-for-securitycyber-benchmarkvulnerability-discoverypatchingagents
Sources2026-07-04
Assessing Automated Prompt Injection Attacks in Agentic Environments
ai-securitysecurity-for-aiprompt-injectionagent-securitybenchmarkagentdojo
Sources2026-07-04
AlphaEvolve: A coding agent for scientific and algorithmic discovery
ai-securityalphaevolvealgorithm-discoveryevolutionary-coding-agentself-evolving-aigoogle-deepmind
Sources2026-07-04
AI Model Extraction Attacks: Bypassing Single-Client Assumptions in Defenses
ai-securitysecurity-for-aimodel-extractionapi-securitydistributed-adversary
Sources2026-07-04
State of Agentic AI Security and Governance 2.01
ai-securitysecurity-for-aiagentic-aigovernanceowaspstandards
Sources2026-07-04
Careful Adoption of Agentic AI Services
ai-securitysecurity-for-aiagentic-aiofficial-guidanceleast-privilegegovernance
Sources2026-07-04
AI Security Solutions Landscape For AI and Agentic Red Teaming Q2 2026
ai-securitysecurity-for-aiagentic-aired-teamingowaspevaluation
Sources2026-07-04
AI Security Solutions Landscape for Agentic AI Q2 2026
ai-securitysecurity-for-aiagentic-aiowasplifecycle-securitysecops
Sources2026-07-04
ZeroDayBench: Evaluating LLM Agents on Unseen Zero-Day Vulnerabilities
ai-securityai-for-securitybenchmarkzero-dayvulnerability-patchingllm-agents
Sources2026-07-04
Whisper Leak: a side-channel attack on Large Language Models
ai-securitysecurity-for-aiprivacyside-channelllm-traffic-analysismodel-security
Sources2026-07-04
Systematic Analysis of MCP Security
ai-securitysecurity-for-aimcp-securitytool-poisoningattack-taxonomybenchmark
Sources2026-07-04
Sleeper Agents: Training Deceptive LLMs that Persist Through Safety Training
ai-securitysecurity-for-aibackdoorsdeceptive-modelssafety-trainingmodel-security
Sources2026-07-04
Security Threat Modeling for Emerging AI-Agent Protocols
ai-securitysecurity-for-aiagent-protocolsthreat-modelingmcp-securitymulti-agent
Sources2026-07-04
Securing AI Agent Execution
ai-securitysecurity-for-aiagent-securitymcp-securityaccess-controlagentbound
Sources2026-07-04
SEC-bench: Automated Benchmarking of LLM Agents on Real-World Security Tasks
ai-securityai-for-securitybenchmarkllm-agentssecurity-tasksvulnerability-reproduction
Sources2026-07-04
SANS 2025 Threat Hunting Survey: Advancements in Threat Hunting Amid AI and Cloud Challenges
ai-securityai-for-securitythreat-huntingsocsurveycloud-securitysecurity-operations
Sources2026-07-04
Prompt Injection Attacks on Agentic Coding Assistants
ai-securitysecurity-for-aicoding-agentsprompt-injectionagent-securitysoftware-supply-chain
Sources2026-07-04
Poisoning Attacks on LLMs Require a Near-constant Number of Poison Samples
ai-securitysecurity-for-aidata-poisoningbackdoorstraining-data-securitymodel-security
Sources2026-07-04
OWASP Top 10 for MCP
ai-securitysecurity-for-aimcp-securityowaspagent-securitytaxonomy
Sources2026-07-04
Model Context Protocol Threat Modeling and Analyzing Vulnerabilities to Prompt Injection with Tool Poisoning
ai-securitysecurity-for-aimcp-securitythreat-modelingtool-poisoningstride
Sources2026-07-04
Model Context Protocol (MCP): Security Design Considerations for AI-Driven Automation
ai-securitysecurity-for-aimcp-securityagent-securitygovernment-guidanceautomation
Sources2026-07-04
Measuring AI Agents' Progress on Multi-Step Cyber Attack Scenarios
ai-securityai-for-securitybenchmarkcyber-rangemulti-step-attackllm-agentsrisk-evaluation
Sources2026-07-04
MCPTox: A Benchmark for Tool Poisoning Attack on Real-World MCP Servers
ai-securitysecurity-for-aimcp-securitytool-poisoningbenchmarkagent-security
Sources2026-07-04
Large Language Models for Cyber Security: A Systematic Literature Review
ai-securityai-for-securitysurveyllm4securityvulnerability-detectionmalware-analysisthreat-intelligence
Sources2026-07-04
GTIG AI Threat Tracker: Advances in Threat Actor Usage of AI Tools
ai-securityai-for-securitythreat-intelligenceai-misusemalwareattack-lifecycledefensive-detection
Sources2026-07-04
Generative-AI Empowered Cyber Threat Intelligence Forecasting
ai-securityai-for-securitycyber-threat-intelligenceforecastingragdefensive-ai
Sources2026-07-04
Generative AI in Cybersecurity: A Comprehensive Review of LLM Applications and Vulnerabilities
ai-securityai-for-securitysurveygenaithreat-detectionincident-responsedatasets
Sources2026-07-04
DARPA's AI Cyber Challenge (AIxCC): Competition Design, Results, and Lessons
ai-securityai-for-securityaixcccyber-reasoning-systemvulnerability-discoverypatchingcompetition
Sources2026-07-04
CyberSecEval 4
ai-securityai-for-securitybenchmarkcybersecevalautopatchbenchvulnerability-patching
Sources2026-07-04
CyberSecEval 3: Advancing the Evaluation of Cybersecurity Risks and Capabilities in Large Language Models
ai-securityai-for-securitysecurity-for-aibenchmarkcybersecevaloffensive-securityllm-evaluation
Sources2026-07-04
Cybench: A Framework for Evaluating Cybersecurity Capabilities and Risks of Language Models
ai-securityai-for-securitybenchmarkctfcyber-rangellm-agentscapability-evaluation
Sources2026-07-04
CVE-Bench: Benchmarking LLM-based Software Engineering Agents' Ability to Fix Real-world Vulnerabilities
ai-securityai-for-securitybenchmarkcve-benchvulnerability-repairsoftware-engineering-agents
Sources2026-07-04
CVE-Bench: A Benchmark for AI Agents' Ability to Exploit Real-World Web Application Vulnerabilities
ai-securityai-for-securitybenchmarkcve-benchweb-securityllm-agentsexploit-evaluation
Sources2026-07-04
BackdoorLLM: A Comprehensive Benchmark for Backdoor Attacks on Large Language Models
ai-securitysecurity-for-aibackdoorsbenchmarkmodel-securityllm-security
Sources2026-07-04
An Embarrassingly Simple Detector for Model Extraction Attacks in LLM APIs
ai-securitysecurity-for-aimodel-extractiondetectionllm-api-securitylatest-research
Sources2026-07-04
AI-Augmented SOC: A Survey of LLMs and Agents for Security Operations
ai-securityai-for-securitysocsecurity-operationsllm-agentsalert-triageincident-response
Sources2026-07-04
Adversarial Machine Learning: A Taxonomy and Terminology of Attacks and Mitigations
ai-securitysecurity-for-aiadversarial-machine-learningtaxonomymodel-securitygovernment-guidance
Sources2026-07-04
A Survey on Model Extraction Attacks and Defenses for Large Language Models
ai-securitysecurity-for-aimodel-extractionmodel-stealingprivacysurvey
Sources2026-07-04
A Practical Guide for Securely Using Third-Party MCP Servers
ai-securitysecurity-for-aimcp-securitythird-party-toolstool-poisoningcontrols
Sources2026-07-04
Zero Trust for Agentic AI: Securing the Enterprise from the AI Transformation
ai-securitysecurity-for-aizero-trustagent-securityidentityruntime-guardrails
Sources2026-07-04
WASP: Benchmarking Web Agent Security Against Prompt Injection Attacks
ai-securitysecurity-for-aiweb-agentsprompt-injectionbenchmarkevaluation
Sources2026-07-04
Towards Secure Systems of Interacting AI Agents
ai-securitysecurity-for-aimulti-agentagent-securityinteraction-security
Sources2026-07-04
The Attack and Defense Landscape of Agentic AI
ai-securitysecurity-for-aiagent-securityattack-landscapedefense-landscapeopen-challenges
Sources2026-07-04
Security of AI Agents
ai-securitysecurity-for-aiagent-securityarchitecturedefenses
Sources2026-07-04
Secure autonomous agentic AI systems
ai-securitysecurity-for-aiagent-securityzero-trustruntime-controlsvendor-guidance
Sources2026-07-04
SAFE-AI: A Framework for Securing AI-Enabled Systems
ai-securitymitresafe-aithreat-mitigationai-assurance
Sources2026-07-04
PoisonedRAG: Knowledge Corruption Attacks to Retrieval-Augmented Generation of Large Language Models
ai-securitysecurity-for-airag-securitydata-poisoningknowledge-poisoningretrieval
Sources2026-07-04
OWASP Top 10 for LLM Applications 2025
ai-securityowaspllm-top-10prompt-injectionrisk-taxonomy
Sources2026-07-04
OWASP Top 10 for Agentic Applications 2026
ai-securitysecurity-for-aiagent-securityowasptaxonomycontrols
Sources2026-07-04
OpenAI Preparedness Framework
ai-securityfrontier-aipreparednesscybersecurity-riskevaluations
Sources2026-07-04
Not what you've signed up for: Compromising Real-World LLM-Integrated Applications with Indirect Prompt Injection
ai-securityprompt-injectionindirect-prompt-injectionllm-applications
Sources2026-07-04
NIST AI 600-1: Artificial Intelligence Risk Management Framework: Generative Artificial Intelligence Profile
ai-securitynistai-rmfgenerative-aigovernancerisk-management
Sources2026-07-04
MITRE ATLAS
ai-securitymitre-atlasthreat-modelingadversarial-aitactics-techniques
Sources2026-07-04
InjecAgent: Benchmarking Indirect Prompt Injections in Tool-Integrated Large Language Model Agents
ai-securityprompt-injectionagentsbenchmarkstool-use
Sources2026-07-04
GraphRAG under Fire
ai-securitysecurity-for-aigraphragrag-securitypoisoningretrieval
Sources2026-07-04
Google Secure AI Framework (SAIF)
ai-securitygooglesaifsecure-ai-frameworkgovernance
Sources2026-07-04
Generative AI's Biggest Security Flaw Is Not Easy to Fix
ai-securityprompt-injectionjournalismindustry-risk
Sources2026-07-04
From Secure Agentic AI to Secure Agentic Web: Challenges, Threats, and Future Directions
ai-securitysecurity-for-aiagentic-webagent-securityopen-challenges
Sources2026-07-04
EchoLeak: The First Real-World Zero-Click Prompt Injection Exploit
ai-securitysecurity-for-aiprompt-injectionenterprise-aidata-exfiltrationincident-analysis
Sources2026-07-04
Design Patterns for Securing LLM Agents against Prompt Injections
ai-securityagentsprompt-injectiondesign-patternsdefenses
Sources2026-07-04
Deploying AI Systems Securely: Best Practices for Deploying Secure and Resilient AI Systems
ai-securitysecurity-for-aideployment-securityresiliencegovernment-guidancecontrols
Sources2026-07-04
Defending LLM Agents Against Context-Aware Prompt Injection
ai-securityagentsprompt-injectioncontext-aware-attacksdefenses
Sources2026-07-04
CISA Tells US Agencies to Fix Security Bugs in as Little as 3 Days Thanks to AI Threats
ai-securitysecurity-for-aivulnerability-managementai-threatsgovernmentnews
Sources2026-07-04
Benchmarking Prompt-Injection Attacks on Tool-Integrated LLM Agents
ai-securityprompt-injectiontool-integrated-agentsdata-exfiltrationprivacy
Sources2026-07-04
Anthropic Responsible Scaling Policy Version 3.0
ai-securityfrontier-airesponsible-scalingmodel-safetycybersecurity-risk
Sources2026-07-04
AI Data Security
ai-securitysecurity-for-aidata-securitydata-provenancepoisoninggovernment-guidance
Sources2026-07-04
AI Cyber Challenge marks pivotal inflection point for cyber defense
ai-securityai-for-securityvulnerability-discoverypatchingai-cyber-challenge
Sources2026-07-04
AI Controls Matrix
ai-securitysecurity-for-aicontrolsgovernancecloud-securityframework
Sources2026-07-04
AI Agents Under Threat: A Survey of Key Security Challenges and Future Pathways
ai-securitysecurity-for-aiagent-securitysurveythreat-modeling
Sources2026-07-04
Agentic AI Security: Threats, Defenses, Evaluation, and Open Challenges
ai-securitysecurity-for-aiagent-securitysurveyevaluationdefenses
Sources2026-07-04
Agent Security Bench (ASB): Formalizing and Benchmarking Attacks and Defenses in LLM Agents
ai-securityagentsbenchmarkprompt-injectionmemory-poisoningdefenses