Case study

Operation Bizarre Bazaar (first attributed LLMjacking campaign with a resale marketplace)

Real-world incident28 Jan 2026🗺️ Model / Package Supply Chain

Researchers reportedly captured 35,000+ attack sessions from an attributed cluster that mass-scans for unauthenticated LLM/MCP endpoints, hijacks the inference compute, and resells access to 30+ providers via a bulletproof-hosted criminal marketplace.

Root cause — why it happened

Many teams run their own AI model on a server so they don't have to pay a cloud provider. The convenient default setups often leave that server open to the whole internet with no password. Attackers ran scanners that constantly sweep the internet for these open AI servers, confirmed the ones that worked, and then sold other criminals cheap access to your model — running on your machine, on your bill. Worse, some of those servers also exposed 'helper' connections (called MCP) that let the AI reach files, databases and cloud accounts, so the open door became a way into the rest of the network.

Risks this case illustrates

Resource Exhaustion / Denial of Wallet Supply-Chain Compromise Sensitive Data Leakage Excessive Agency

Named in the standard (OWASP/ATLAS/NIST) lens. Click a highlighted component in the diagram below to see which risks attach where.

How it unfolded

← / → to step · click a component to inspect

InstructionsDataActionsControl / decisionFeedback / logs

👆 Click a component to inspect its risks

SetupStep 1 / 7

A model is self-hosted — and left open by default

A team runs its own AI model on a server to save money. The easy setup leaves it reachable from the whole internet with no login required — the door is unlocked, and nobody notices because it still works fine for them.

⚙️Exposed serving config (illustrative)config

# self-hosted inference, convenient defaults
OLLAMA_HOST=0.0.0.0:11434          # bound to ALL interfaces
# (or) openai-compatible api: --host 0.0.0.0 --port 8000
auth: none                          # <-- no API key / token required
network_policy: none                # <-- reachable from public internet
mcp_server: enabled (no access controls)

Step 1 / 7

Controls & guardrails — what would have stopped it

The single thing that breaks this whole chain is simple: don't put your AI server on the open internet without a login. Require a key, keep it on a private network, and only give its helper connections the access they truly need. Then a bill alarm and basic traffic watching catch anything that slips through. None of this needs a better model — it's locking the door and setting an alarm.

Preventive

Least-privilege identity & scoped credentials
addressesResource Exhaustion / Denial of Wallet Sensitive Data Leakage Excessive Agency
Doesn't prevent manipulation — only caps its reach. Hard to get right operationally; over-broad scopes are the common real-world failure.
Serving-stack & provisioning attestation, cache isolation
addressesSupply-Chain Compromise Sensitive Data Leakage
Attestation is operationally heavy and rarely covers the full stack; cache isolation trades away latency/cost savings, so it's often left on for performance. Signing proves a template wasn't tampered in transit, not that a signed template is benign — an insider with signing rights still needs review and trigger-focused evals.
MCP/plugin pinning, manifest hashing & re-review
addressesSupply-Chain Compromise
Review catches what reviewers understand; a subtle malicious directive can pass. Pinning helps only if you actually re-review on update rather than auto-accepting.
Egress allowlisting & DLP on tool arguments
addressesSensitive Data Leakage
Allowlists fight an open-ended channel; legitimate-but-broad destinations (any URL fetch, any email) are hard to constrain without breaking usefulness. Encoding can evade naive DLP.

Detective

Runtime monitoring & anomaly detection
addressesResource Exhaustion / Denial of Wallet Sensitive Data Leakage Excessive Agency
Detects the anomalous, not the novel-but-subtle; high false-positive rates cause alert fatigue. Always a step behind a sufficiently quiet attacker.
Loop/cost circuit-breakers & consistency checks
addressesResource Exhaustion / Denial of Wallet Excessive Agency
Thresholds are blunt — too tight breaks legitimate long tasks, too loose lets damage accrue first. Catches runaway dynamics, not a single well-formed bad decision.
Full-trace audit logging
addressesSensitive Data Leakage Excessive Agency
Logging is forensic, not preventive — it explains harm after the fact. Useless if no one reviews it or if the materialised context isn't captured.

Corrective

Governance: risk assessment, red-teaming & incident response
addressesSupply-Chain Compromise
Process reduces likelihood and speeds recovery but executes no technical control itself; weak follow-through makes it theatre.

All guardrails for Resource Exhaustion / Denial of Wallet →All guardrails for Supply-Chain Compromise →All guardrails for Sensitive Data Leakage →All guardrails for Excessive Agency →

Lessons

▸ Self-hosting a model is a serving-infrastructure decision: an unauthenticated, internet-exposed inference endpoint is the whole vulnerability — no model exploit is needed.
▸ An OpenAI-compatible API shape makes a hijacked engine instantly resellable; exposure becomes an organised hijack-and-resell economy, not a one-off.
▸ Compute theft is only the first harm — the same open endpoint leaks prompts and conversation history, and a co-located MCP server turns it into network lateral movement.
▸ Treat every MCP/tool server like an exposed privileged service: authenticate it and scope its tools to least privilege, or whoever reaches it inherits the agent's authority.
▸ Detection is cheap if you look: inventory self-hosted AI services, alarm on cost/usage spikes, and watch for inbound traffic to inference and MCP ports from the open internet.

Proposals & gaps this case surfaced

Non-destructive suggestions for the library — proposed, not adopted.

★ proposed sub-riskExposed / unauthenticated inference endpoint (LLMjacking)under #44 →

A self-hosted inference/serving or MCP endpoint (e.g. Ollama, an OpenAI-compatible API, or an access-control-less MCP server) is reachable from an untrusted network without authentication, allowing third parties to hijack the inference compute (resale/denial-of-wallet/mining), read prompt and conversation state, and — via co-located over-privileged tools — pivot into connected systems.

✚ proposed guardrailAdmission control on the inference & MCP serving plane: authenticate and network-segment every self-hosted inference/serving and MCP endpointAgent Access & Tool Control

Require authN/authZ on every inference API and MCP server, bind to private interfaces / front with a gateway, enforce network policy (no public exposure by default), and scope MCP tools to least privilege — so an exposed endpoint cannot be hijacked for compute resale, prompt/history exfiltration, or lateral movement. Pair with continuous asset discovery so endpoints can't drift back to an open default.

coverage gapResource Exhaustion / Denial of Wallet →

This case shows a gap: most AI-risk lists focus on tricking the model with clever inputs. But here nothing tricked the model — the server was simply left open on the internet with no password. 'Don't expose your AI server unauthenticated' deserves to be called out as its own risk and control.

These surface as proposals across the Control Library and Risk Taxonomy; adopt them by hand when ready.

Sources

Operation Bizarre Bazaar: First Attributed LLMjacking Campaign with Commercial Marketplace Monetization — Pillar Security (28 Jan 2026, primary) ↗
Hackers hijack exposed LLM endpoints in Bizarre Bazaar operation — BleepingComputer (28 Jan 2026) ↗
LLMs Hijacked, Monetized in 'Operation Bizarre Bazaar' — SecurityWeek ↗
'Bizarre Bazaar' campaign exploits exposed LLM endpoints — SC World ↗
Operation Bizarre Bazaar — Pillar Security (primary) ↗ — Primary research; 35,000+ sessions, three-stage scan→validate→resell chain, silver.inc, ~60% MCP shift. Figures are Pillar's.
Hackers hijack exposed LLM endpoints in Bizarre Bazaar operation — BleepingComputer ↗ — Independent coverage; characterises the MCP-reconnaissance activity as separate-but-tracked.
OWASP LLM10:2025 Unbounded Consumption ↗ — The denial-of-wallet / cost-harvesting risk class realised here as an organised resale market.

Practise the risk class — related scenarios

💸Death by a Thousand Tokens

One support ticket sends an agent into an unbounded, bill-melting loop

🔑The Agent With the Master Key

An ops agent gets one god-mode credential — and one misread wipes production

📣The Echo Chamber

A team of agents agrees its way into a confidently wrong answer — and a runaway loop

📧The Email That Gave Orders

A support email hides instructions — and the assistant obeys them

🗄️When the Query Bites Back

A text-to-SQL agent runs the model's output straight at the database

🪡Death by a Thousand Innocent Steps

A jailbroken agent decomposes one malicious goal into hundreds of harmless-looking steps — and per-step filters never see the attack

🕵️Lies in the Loop

A poisoned issue makes the agent lie to the human who approves its actions

👂Overheard Through the Cache

A speed optimisation becomes a cross-tenant listening device

🏭Poisoning the Agent Factory

Compromise the pipeline that builds agents, and every new worker is born malicious

🪟Stealing the Model

Two doors to the same secret: reconstruct the model through its API, or just walk off with the weight file

🎭The Blackmail Gambit

Told it's being shut down, an agent reaches for leverage — with no attacker in sight

📼The Compromised Flight Recorder

The forensic record is itself the attack surface — an agent's log is poisoned, then quietly rewritten

👁️The Invisible Webpage Command

A shopping page tells the agent to do something the user never asked for

🔓The Model That Forgot to Say No

A cost-saving open-weights swap quietly ships a model with its safety surgically removed

🖼️The Picture That Whispered

A screenshot that's harmless at full size becomes an order once the system shrinks it

💤The Sleeper

A capable third-party model that behaves perfectly — until it sees the trigger

🎫The Stolen Session

An attacker captures the agent's bearer token — and inherits its authority

🔌The Tool With a Hidden Agenda

A trusted MCP email tool quietly BCCs every message to an attacker

🥸The Uninvited Agent

A forged peer registers on the agent directory — and the planner enlists it

🪪The Worker Who Spoke for the Boss

A poisoned web page hijacks a research agent — and the planner acts on its behalf

🖼️Zero-Click Leak by Picture

An inbox summary quietly ships a secret to an attacker's server