Identity Deepfake (Face Swap & Talking Head)

One scraped photo or a three-second clip is enough to become someone else

Architecture introduced 20 May 2019

This system takes a single photo or a short clip of a person — scraped from a social profile, a press shot, a video call — and turns it into a convincing fake of them. One part captures a 'faceprint' from the reference; another pastes that face onto a target's head; a motion part makes it move as video; and an audio path clones the voice so the fake can talk and lip-sync. The only thing standing between 'a public photo' and 'a video of you saying something you never said' is a consent check that is usually missing.

InstructionsDataActionsControl / decisionFeedback / logs

👆 Click any component in the diagram to inspect its risks & defenses

Follow a request · step 1 of 6

← / → keys

It starts with a request — 'make this person say or do this' — plus a photo or clip of someone scraped from the open web. The target almost never knows.

Next: TTS & Zero-Shot Voice Cloning →