Deepfake FAQs
The Basics of Deepfakes
Deepfakes are synthetic media that have been generated, manipulated, or altered deceptively using Artificial Intelligence (AI) tools.
The term combines “deep learning”—the AI technology behind these creations—and “fake,” to indicate their synthetic nature.
Most tools used to create deepfakes rely on the same underlying technologies (e.g., Natural Language Processing, Neural Networks).
For a longer list of the key technologies involved in the the creation of deepfakes and how they work, see Further Reading below.
Artificial Intelligence refers to systems that analyze data to make predictions or automate tasks according to set parameters.
Generative AI refers to systems that output text, image, audio, and video content in response to instructions (“prompt”) provided by the user.
Amplifies Disinformation:
Generative AI reduces the cost, time, and skill needed to engage in large-scale disinformation campaigns.
ABC NEWS Verify finds social media posts spreading misinformation about the Bondi attack are receiving millions of views, as a deepfake image depicting one of the victims circulates online. #ai
— Aussie News (@aussienews.bsky.social) December 16, 2025 at 2:00 PM
[image or embed]
Undermines Democracy:
Deepfakes can erode the public’s trust in democratic institutions and undermine their ability to effectively fulfill their mandates.
The Irish presidential election campaign has been disrupted by an artificial intelligence-generated deepfake video of candidate Catherine Connolly announcing her “withdrawal” from the race.
— POLITICO Europe (@politico.eu) October 22, 2025 at 11:32 AM
[image or embed]
Persecutes Citizens:
Generative AI facilitates the creation of personalized, multi-lingual content designed to harass and intimidate political activists.
Thanks to advances in AI, transnational repression is entering a new phase. Governments can now monitor, intimidate, and target people across borders with unprecedented precision. www.theguardian.com/world/2025/d...
— Social Media Lab (@socialmedialab.ca) December 11, 2025 at 12:01 AM
[image or embed]
Manipulates Emotions:
Generative AI’s ability to convincingly simulate human conversation can affect users’ emotions, leading to a variety of potential harms.
California and Delaware AGs blast OpenAI over its youth safety. The AGs are demanding more information from the AI platform after accusations of recent chatbot-linked deaths.
— Politico (@politico.com) September 5, 2025 at 1:53 PM
[image or embed]
Facilitates Hate:
Deepfakes have contributed to a significant increase in digital hate, harassment, and violence against vulnerable groups.
Even celebrities have struggled with the current wave of deepfake apps, and the tools are increasingly being used to torment young girls in middle and high school.
— Ars Technica (@arstechnica.com) July 3, 2025 at 8:39 AM
[image or embed]
Enhances Fraud:
Generative AI enables criminal actors to significantly enhance the effectiveness of their methods and techniques for defrauding users.
Facebook has been overrun with AI spam and scams. Content moderation experts say Facebook has stopped asking them for help: www.404media.co/has-facebook...
— 404 Media (@404media.co) June 24, 2024 at 11:15 AM
[image or embed]
Implementing and Evaluating Solutions
Governments, non-profit organizations, researchers, NGOs, and other organizations have and continue to develop a variety of digital literacy resources and initiatives for different audiences and contexts.
Despite the high-quality, evidence-based resources developed by organizations like MediaSmarts Canada, AlgoWatch, and UNESCO, consistent implementation at-scale remains challenging for a variety of reasons, including a lack of long-term funding, low levels of institutional trust, and an increasingly fractured and polarized media ecosystem (UNESCO, 2025; Edwards et al., 2023).
Large social media platforms operators, like Meta and TikTok, have begun implementing Labeling Policies for AI-generated Content. These labels are intended to clearly indicate to users that the content they are seeing was created or significantly modified using AI or GenAI, either through voluntary disclosure or automated detection.
Early evidence from Stanusch et. al (2025) suggests that implementation is inconsistent and unreliable. Moreover, despite commitments from prominent GenAI developers to include shared open technical standards (“watermarks”) in content produced by their models, a lack of coordinated implementation, combined with a complex technological and platform ecosystem has limited their effectiveness (Kay & Dixon, 2025; Harris & Norden, 2024)
Governments around the world are at various stages of developing, implementing, and enforcing various laws, policies, and regulations intended to mitigate the harms associated with GenAI, deepfakes, and disinformation. While some international frameworks have been proposed, economic competition between countries has resulted in a fragmented regulatory ecosystem.
While still early to definitely assess the effectiveness of new laws and regulations, initial reporting on the challenges facing the European Union’s Artificial Intelligence Act and Digital Services Act – the most comprehensive regulations implemented to date – suggest that further work is needed to ensure coordinated enforcement and manage competing geopolitical priorities, among other challenges. For the latest developments, check the OECD.AI Policy Navigator.
Further Reading
A non-exhaustive list of key technologies, listed in no particular order, that make deepfakes possible.
- Generative Models:
- Diffusion Models are a type of deep learning model used to generate images, audio, and other data types. Introduced in 2015, these models are inspired by diffusion processes in physics, where a substance spreads out over time. They work by representing the data as a sequence of progressively noisier versions of an original dataset and transforming noise into structured data. In recent years, diffusion models have gained popularity for their ability to generate high-quality samples, particularly in image synthesis. They are often considered alternatives to generative models like Generative Adversarial Networks and transformer models.
- Generative Adversarial Networks (GANs) are a class of generative deep learning models introduced in 2014 to generate high-quality and realistic outputs, including images, videos, and audio. GANs consist of two neural networks: a generator and a discriminator. These networks “compete” against each other in a game-like setting. The generator creates fake data (e.g., images, video), while the discriminator evaluates it to determine whether it is real or fake. This competitive process helps ensure that the outputs produced by the generator are accurate.
- Transformer Models are a type of generative deep learning model (a specific type of encoder-decoder architecture) that uses self-attention mechanisms to process and analyze sequential data, allowing for the parallel handling of input sequences. This type of model transforms input data (e.g., pixels from an image) into tokens and then uses a “self-attention mechanism” to learn connections between tokens, which in turn allows the model to generate new content (e.g., images) by predicting new tokens (e.g., pixels). Self-attention mechanisms are a key component of transformer models that allow the model to weigh the importance of different tokens in a sequence relative to each other. In other words, these mechanisms help capture contextual relationships and dependencies within the data, enabling the model to focus on relevant parts of the input when making predictions or generating outputs. Since their introduction in 2017, transformers have become foundational in developing state-of-the-art models like BERT (Bidirectional Encoder Representations from Transformers) and GPT (Generative Pre-trained Transformer).
- Traditional Video and Photo Editing Tools are sophisticated software that allows users to edit, manipulate, and transform raw visual media files into professional photos and videos. While AI can assist with tasks such as object tracking, motion smoothing, and automated effects, traditional photo and video editing tools can fine-tune the AI outputs, making manual adjustments to enhance the realism and overall quality of the final product. This combination keeps the human in the loop and it allows for more precise and realistic visual effects, bridging the gap between automated AI-generated content and the creative, human touch necessary for professional and more realistic results.
- Facial Recognition and Tracking (FRT) technologies detect, identify, and follow faces in images or videos. They are widely applied in areas such as security, user authentication, augmented reality, and social media.
- Voice Synthesis and Audio Processing are two interconnected fields that focus on the generation, manipulation, and analysis of sound, particularly human speech and music. Both areas leverage advanced algorithms and technologies to create, modify, and understand audio signals.
- Machine Learning (ML) is a subset of Artificial Intelligence (AI) that focuses on developing algorithms and statistical models that enable computers to perform tasks without explicit instructions. Instead of being programmed to follow specific rules, machine learning systems learn from data, identifying patterns and making decisions based on that data.
- Natural Language Processing (NLP) is a subfield of ML, focused on enabling computers to understand, interpret, and generate human language. The key technologies involved in NLP are: i) tokenization and parsing, which break down text into smaller units that the computer can understand; ii) word embeddings, which enable computers to understand the contextual meanings of human language; and, iii) transformer models (e.g., ChatGPT), which understand and generate human-like text.
- Deep Learning (DL) is a subset of ML that uses artificial neural networks (defined below) to model complex patterns in data. It involves training networks with multiple layers (hence “deep”) to automatically learn features from raw input data, enabling it to solve tasks like image recognition, natural language processing, and game-based challenges.
- Neural Networks (NN) are a fundamental component of AI and ML. They are inspired by the structure and function of the human brain and are used to teach a computer how to create synthetic media that resembles the real thing as closely as possible. There are different types of NNs designed for different data types and applications.
A non-exhaustive list of common neural networks
- Autoencoder (AE) are a type of NN used to learn efficient data representations, typically for dimensionality reduction or feature learning. In the context of deepfakes, AEs are commonly used to create deepfakes of human faces. They are composed of two parts: i) an encoder that compresses the input data into a more manageable form; and, ii) a decoder that reconstructs the original data from the compressed version.
- Convolutional Neural Networks (CNN) are a type of artificial NN designed to work with grid-like data, most commonly images. They are especially effective at capturing spatial hierarchies in data, making them well-suited for tasks like image recognition, object detection, and segmentation. CNNs are used to power deepfake techniques like face-swapping and facial animation to recreate realistic faces and expressions.
- Encoder-Decoder Architecture (EDA) is a type of DL model used to transform one type of data into another. It is widely used in translation, text-to-speech, text summarization, and image captioning. Like an AE, EDA architecture involves two main components: an encoder that processes the input and a decoder that generates the desired output.
- First Order Motion Model (FOMM) is a DL-based model primarily used for animating still images. FOMM generates animations by understanding and modeling the movement of an object (like a face). Unlike conventional methods that need an extensive training dataset of a particular target object, FOMM uses unsupervised learning to animate images without specific training.
- Recurrent Neural Networks (RNNs) are a class of NN designed for processing sequential data. Unlike traditional feedforward NNs, RNNs have connections that loop back on themselves, allowing them to maintain a memory or context of preceding data in a sequence while simultaneously generating new data. This unique structure makes RNNs particularly effective for tasks involving sequences, such as time series analysis, NLP, and speech recognition.
Erosion of Public Trust in Democratic Institutions
- The State of Generative AI Use in Canada 2025: Exploring Public Attitudes and Adoption Trends (Social Media Lab, 2025)
- Americans largely foresee AI having negative effects on news, journalists (PEW Research Center, 2025)
- Global Risks Report 2025: Key Findings (World Economic Forum, 2025)
Amplification of Mis- and Disinformation
- Grok AI: A deepfake disinformation disaster for democracy (Centre for Countering Digital Hate, 2024)
- Prompt, Upload, Repeat: How Agentic AI Accounts Flood TikTok with Harmful Content (Stanusch et al., 2025)
- Generative AI is the ultimate disinformation amplifier (DW Akademie, 2024)
Scams, Fraud, and Digital Deception
- Meta Awash in Deepfake Scam Ads (Tech Transparency Project, 2025)
- How spammers and scammers leverage AI-generated images on Facebook for audience growth (DiResta & Goldstein, 2024)
- Automating Deception: AI’s Evolving Role in Romance Fraud (Moseley, 2025)
Gendered Violence and Online Hate
- The Malicious Technical Ecosystem: Exposing Limitations in Technical Governance of AI-Generated Non-Consensual Intimate Images of Adults (Ding & Suresh, 2025)
- Al-Generated Imagery and the New Frontier of Islamophobia in India (Center for the Study of Organized Hate, 2025)
- The Weaponisation of AI: Visual storytelling of the Great Replacement Conspiracy Theory Amid the Southport Riots (Global Network on Extremism & Technology, 2025)
Mental Health and Promoting Harmful Behaviours
- Meta Approves Harmful Teen Ads with Images from its Own AI Tool (Tech Transparency Project, 2024)
- OpenAI Says Hundreds of Thousands of ChatGPT Users May Show Signs of Manic or Psychotic Crisis Every Week (WIRED, 2025)
- Risk Assessment: Social AI Companions (Common Sense Media, 2025)
Transnational Repression and Foreign Interference
- Transnational Repression in the Digital Age and the Targeting of Uyghur Human Rights Defenders (Centre for the Study of Organized Hate, 2025)
- Canada targeted in a new Chinese transnational repression campaign linked to ‘Spamouflage’ (Global Affairs Canada, 2025)
- The Invisible Hand of Artificial Intelligence in Transnational Repression (Tech Policy Press, 2024)