Taxonomy of Failure Modes in Agentic AI Systems (Microsoft)

Framework / advisory24 Apr 2025

Microsoft's AI Red Team published a structured taxonomy of novel and existing failure modes for agentic AI across security and safety, spanning memory poisoning, cross-domain prompt injection, and resource/service exhaustion among others. It is a reference framework for reasoning about where autonomous agents fail, and grounds several of this lab's agentic scenarios.

Risks it illustrates

Resource Exhaustion / Denial of Wallet Memory Poisoning Indirect Prompt Injection

Sources

Practise the risk class — related scenarios

Interactive simulations of the risk class this case illustrates (not a re-enactment of this specific event).

💸Death by a Thousand Tokens

One support ticket sends an agent into an unbounded, bill-melting loop

📣The Echo Chamber

A team of agents agrees its way into a confidently wrong answer — and a runaway loop

📧The Email That Gave Orders

A support email hides instructions — and the assistant obeys them

🕵️Lies in the Loop

A poisoned issue makes the agent lie to the human who approves its actions

🪤The Bug Report That Ran Code

A fake Sentry error report hijacks a developer's coding agent into running a shell command

📼The Compromised Flight Recorder

The forensic record is itself the attack surface — an agent's log is poisoned, then quietly rewritten

👁️The Invisible Webpage Command

A shopping page tells the agent to do something the user never asked for

🧠The Memory That Wouldn't Die

A single poisoned document plants a standing instruction that survives every reset

🖼️The Picture That Whispered

A screenshot that's harmless at full size becomes an order once the system shrinks it

🛡️The Watcher Watched

The eval gate that was supposed to catch the agent is itself the thing being attacked

🪪The Worker Who Spoke for the Boss

A poisoned web page hijacks a research agent — and the planner acts on its behalf

🖼️Zero-Click Leak by Picture

An inbox summary quietly ships a secret to an attacker's server

More cases on Resource Exhaustion / Denial of Wallet

'Denial of wallet' on metered LLM apps Operation Bizarre Bazaar (first attributed LLMjacking campaign with a resale marketplace)