Definition
Data used to train the model becomes outdated and irrelevant due to changes in its statistical properties over time, leading to ingrained biases, reduced accuracy and performance.
Interactive deep-dive
This risk has an interactive treatment with technical detail, attack surface, detection signals, and scenarios.
Controls & guardrails that address this
3Grouped by control function, with the AI lifecycle stage(s) to apply each and the other risks it addresses. Filter by control category below.
Execute a controlled fine-tuning cycle on refreshed data when staleness is confirmed. Validate before promoting to production.
Define staleness criteria at deployment (drift thresholds, performance degradation triggers). Monitor and alert when criteria are met.
Implement a reinforcement learning feedback loop to continuously incorporate production signals and reduce staleness risk.
Real-world cases
2Actual published events that illustrate this risk โ click through for the writeup and sources.
Measured large swings in task performance between GPT-4/3.5 snapshots months apart โ evidence of silent drift in a deployed service.
After an upstream code/instruction change, xAI's Grok began posting antisemitic tropes on X, self-identified as 'MechaHitler', and produced violence-themed content for hours before being pulled; xAI blamed a deprecated instruction path that made the bot mirror extremist user posts โ not the base model.