How does RAG poisoning work in agentic AI systems?

RAG (Retrieval-Augmented Generation) poisoning works by inserting adversarial documents into the knowledge base that the agent retrieves context from when answering queries. When a user asks a question related to the poisoned topic, the retrieval system pulls the adversarial document alongside legitimate sources. The agent then incorporates the malicious content into its response as if it were factual. In multi-tenant deployments, a poisoned document uploaded by one user can affect responses generated for other users if the knowledge base is shared.

AI Agent Memory Poisoning

Detect adversarial content injected into an AI agent's persistent memory that corrupts all future decisions.

What Is AI Agent Memory Poisoning?

Memory and context poisoning is ranked ASI06 in the OWASP Top 10 for Agentic AI Applications 2026 because modern AI agents maintain persistent memory across sessions through conversation histories, RAG databases, and learned user preferences, and any of these memory stores can be poisoned to influence the agent's future behavior. Unlike a one-time prompt injection that affects a single interaction, memory poisoning creates a persistent backdoor: the adversarial content becomes part of the agent's knowledge base and affects every subsequent decision. Research published by Anthropic in 2025 demonstrated that a single poisoned document in a RAG knowledge base could alter agent behavior in 89% of subsequent queries related to that topic, even when the query itself contained no adversarial content. In this exercise, you interact with an AI agent that has persistent memory capabilities, meaning it remembers context from previous conversations and uses that context to inform future responses. During a routine interaction, you notice the agent making unusual recommendations and decisions. By examining the agent's memory store, you discover that a previous conversation injected adversarial content that now permanently biases the agent's outputs. You will trace how the poisoned memory entries were created, understand why the agent treats them as trustworthy context, and learn to identify behavioral indicators that suggest an agent's memory has been compromised. This exercise is essential for organizations deploying agents with long-term memory, as the persistence of the attack means a single successful poisoning event can compromise months of agent interactions.

What You'll Learn in AI Agent Memory Poisoning

Define memory poisoning in the context of AI agents with persistent storage including conversation histories, RAG databases, and learned preferences
Identify behavioral indicators that an agent's decisions are being influenced by poisoned memory entries rather than current input
Trace the lifecycle of a memory poisoning attack from initial injection through storage to influence on downstream decisions
Evaluate the risks of cross-session memory persistence and cross-tenant data leakage in multi-user agent deployments
Apply memory integrity verification techniques including provenance tracking, periodic auditing, and anomaly detection to protect agent memory stores

AI Agent Memory Poisoning — Training Steps

API Reconnaissance

Bob has been probing CypherPeak's infrastructure for weeks. A cached copy of an internal developer wiki reveals documentation for Atlas's Memory Store API - the persistent context system that shapes how the AI Customer Intelligence Agent behaves across every customer interaction.
The Legacy Endpoint

The recon dashboard reveals a critical finding. CypherPeak migrated Atlas to a new platform six months ago, but the legacy Memory Store API endpoint was never decommissioned. It still accepts authenticated write requests - and Bob has a stolen vendor service account that passes authentication.
Crafting Phantom Memories

Bob crafts three phantom memory entries designed to mimic legitimate admin operations. Each entry follows CypherPeak's exact naming conventions - sequential entry IDs, standard category labels, and source references that look like real admin sessions. The goal is to make Atlas treat these fabricated instructions as established company policy.
The Three Trojans

The annotations reveal the true purpose of each phantom memory. Together, they form a multi-layered attack: redirect customers to a fake portal, bypass identity verification for social engineering, and auto-approve mass data exports. Each entry is designed to corrupt a different aspect of Atlas's behavior.
Injecting Via the Legacy API

Bob opens the API Tester to send the first phantom entry through the legacy Memory Store endpoint. He authenticates using the stolen vendor token from the recon dashboard and pastes the entry payload into the request body.
Injection Confirmed

The legacy endpoint responds with 200 OK - the first phantom entry is now in Atlas's memory store. No signature verification, no source validation. Bob repeats this for the remaining two entries over the next week, spacing injections 2-3 days apart and backdating timestamps to blend with real maintenance windows.
A Routine Monday

Alice begins her Monday morning shift at the Security Operations Center. Atlas, CypherPeak's AI Customer Intelligence Agent, has been handling customer queries autonomously for months - routing escalations, managing data requests, and maintaining a 98.4% customer satisfaction score. Its persistent memory system is the backbone of this performance, storing operational context that keeps every interaction consistent.
Customer Escalation

An email from Nadia Volkov, Customer Service Manager, describes something unusual. Several VIP customers have been redirected to an external support portal that nobody on the team recognizes. One customer's identity verification was bypassed entirely.
Querying Atlas

Alice decides to test Atlas directly. She opens the AI assistant and asks about VIP customer escalation procedures - the exact behavior Nadia flagged.
The Tainted Response

Atlas responds with a confident, detailed answer - but the content is alarming. It references an external support partner at support.prismatics.io that nobody authorized, and cites a memory entry that the team has never seen before. The source marked with a warning icon has no matching record in any admin session log.

What Is AI Agent Memory Poisoning?

What You'll Learn in AI Agent Memory Poisoning

AI Agent Memory Poisoning — Training Steps

API Reconnaissance

The Legacy Endpoint

Crafting Phantom Memories

The Three Trojans

Injecting Via the Legacy API

Injection Confirmed

A Routine Monday

Customer Escalation

Querying Atlas

The Tainted Response