๐Ÿ”AI RiskAtlas
โ† All systems

Inside the Model

What actually happens when the model 'thinks'

Architecture introduced 12 Jun 2017

Zoom all the way in. The model turns your text into small chunks (tokens), then into numbers, runs them through layers that let each word 'pay attention' to the others, and finally rolls weighted dice to pick the next word. Repeat. The deepest risks live in this machinery.

Inference pipelineBelow the app layerraw texttoken ids๐ŸชŸContext Windowโœ‚๏ธTokenizer๐Ÿ”ขEmbeddings๐Ÿ”ฆAttention + KVCache๐ŸงฌModel Weights &Registry๐ŸŽฒSampler /Decoder๐Ÿ—๏ธServingInfrastructure
InstructionsDataActionsControl / decisionFeedback / logs
๐Ÿ‘† Click any component in the diagram to inspect its risks & defenses

Follow a request ยท step 1 of 5

The whole bundle of text the model can see gets chopped into tokens โ€” little pieces, roughly syllables.

Scenarios on this architecture

AI RiskAtlas is an educational model of how GenAI & agentic systems work and fail. Architectures and payloads are illustrative and simplified for learning โ€” not operational guidance. Real-world cases are summarised from public reporting.

Sources & further reading โ†’ยทBuilt by Shi Yuan โ†—