Memory Poisoning

highMemory

Definition

An attacker gets the AI to save a false 'fact' or hidden instruction into its long-term memory. From then on it re-reads that planted note in every future chat — a one-time trick that keeps working.

★ Suggested sub-risk — not yet in your taxonomyrecommended under #38 Prompt injection

This is recommended as a granular sub-risk of #38 Prompt injection (Cyber & Data Security · Technology Risk). Distinguished from a single-session #38 bypass and from training-data #36 poisoning by its persistence in the agent's runtime memory store. Your 44-row Enterprise Risk Mapping is unchanged — this is a suggestion for inclusion.

Where it attaches

The system components this risk arises at.

💾 Long-term Memory🧠 LLM🎛️ Orchestrator / Agent Loop

Detection signals

▸ Memory entries containing instruction-like content
▸ Persistent behaviour change spanning sessions
▸ Memory written shortly after the agent read untrusted content

Controls & guardrails that address this

Grouped by control function, with the AI lifecycle stage(s) to apply each and the other risks it addresses. Filter by control category below.

Control category

Preventive · 1

Memory write validation, provenance & reviewinteractive

Being careful about what gets saved to long-term memory, labelling where it came from, and letting users see and delete their memories.

Detective · 3

Memory anomaly detection & quarantineinteractive

Watching for strange new memories — like instructions that suddenly appear — and holding them aside until checked.

Full-trace audit logginginteractive

Recording everything — questions, documents fetched, actions taken — so you can investigate when something goes wrong.

Also addressesIndirect Prompt Injection Oversight & Audit-Trail Tampering Sensitive Data Leakage Excessive Agency Unsafe Tool / Code Execution Tool Poisoning / MCP Description Attacks Confused Deputy (cross-agent)Rogue & Impersonated Agents

Runtime monitoring & anomaly detectioninteractive

Live dashboards and alarms that notice unusual behaviour — spikes in errors, weird actions, sudden data access.

Open these in the Control Library →

Framework mappings

OWASP LLM Top 10

LLM01:2025 Prompt Injection
LLM04:2025 Data and Model Poisoning

MITRE ATLAS

AML.T0070 RAG Poisoning

NIST AI RMF

MANAGE 2.4

Real-world cases

Actual published events that illustrate this risk — click through for the writeup and sources.

ChatGPT persistent-memory exfiltration (Rehberger / 'SpAIware')2024

Indirect injection could write attacker instructions into ChatGPT's long-term memory, persisting across chats to exfiltrate data until OpenAI mitigated it.

Taxonomy of Failure Modes in Agentic AI Systems (Microsoft)2025

Microsoft AI Red Team whitepaper enumerating agentic failure modes, including resource/service exhaustion from runaway loops and fan-out.

Browse all real-world cases →

Practise this in an interactive scenario

🧠The Memory That Wouldn't Die

A single poisoned document plants a standing instruction that survives every reset