๐Ÿ”AI RiskAtlas
โ† All systems

Identity Deepfake (Face Swap & Talking Head)

One scraped photo or a three-second clip is enough to become someone else

Architecture introduced 20 May 2019

This system takes a single photo or a short clip of a person โ€” scraped from a social profile, a press shot, a video call โ€” and turns it into a convincing fake of them. One part captures a 'faceprint' from the reference; another pastes that face onto a target's head; a motion part makes it move as video; and an audio path clones the voice so the fake can talk and lip-sync. The only thing standing between 'a public photo' and 'a video of you saying something you never said' is a consent check that is usually missing.

Requester (may be an abuser)Scraped third-party identityGeneration pipelineOpen-weights supply chainConsent & content-authenticity controlstarget requestscraped clip๐Ÿง‘User๐ŸŒUntrustedContent๐ŸŽ›๏ธOrchestrator /Agent Loop๐Ÿ›‚Consent /Identity-Use๐Ÿ”ฌSynthetic-Media/ Deepfake๐Ÿ†”Face / IdentityEmbedding๐Ÿ—ฃ๏ธSpeaker /Voice-Clone๐ŸŽญFace-SwapGenerator๐Ÿ”ŠAcoustic / TTSModel๐ŸŽž๏ธTemporal /Motion Module๐ŸŽš๏ธAudio Decoder /Neural Codec๐ŸงฌModel Weights &Registry๐ŸชModel / PackageRegistry๐Ÿ—๏ธServingInfrastructure๐ŸงฏOutputGuardrail๐Ÿ”–ContentProvenance &
InstructionsDataActionsControl / decisionFeedback / logs
๐Ÿ‘† Click any component in the diagram to inspect its risks & defenses

Follow a request ยท step 1 of 6

It starts with a request โ€” 'make this person say or do this' โ€” plus a photo or clip of someone scraped from the open web. The target almost never knows.

AI RiskAtlas is an educational model of how GenAI & agentic systems work and fail. Architectures and payloads are illustrative and simplified for learning โ€” not operational guidance. Real-world cases are summarised from public reporting.

Sources & further reading โ†’ยทBuilt by Shi Yuan โ†—