Member-only story
What Is Diffusion in AI and How Does It Work?
Understand the core concept behind generative AI models
This year, we witnessed the rise of text-to-image AI as advancements significantly improved the quality of output AI models can produce.
Controversial AI solutions such as Stable Diffusion and OpenAI’s DALL-E have been integrated into websites like Canva or DeviantArt to provide advanced creative tools, customized branding, and even generate product ideas.
Yet, the technology behind these systems, known as diffusion, has the potential to do more than create art. Much more. Some research groups are using diffusion to digitize music, DNA sequences, and develop new drugs, and the tech has improved at a staggering pace over the past few years — so this is only the beginning.
What exactly is diffusion, and why is it such a significant advancement, you might ask? Let’s take a look at the origins of diffusion and how it’s evolved to become the powerful technological driver it is today.
How diffusion came about
Remember the hysteria around deepfakes a few years back — such as the famous “ deeptom” TikTok experiment?
These apps used AI to insert people’s faces (or their entire bodies) into existing images and videos, creating realistic renditions of the original subjects. Using an AI technology called generative adversarial networks (GANs), the apps could embed a person’s face into a given content with such outstanding accuracy, it could fool anyone who’s not an expert in the subject.
How do GANs work? They consist of 2 parts:
- A generator that produces synthetic examples (like images) from random data, and
- A discriminator that tries to distinguish between synthetic and real-world examples, learning from a training dataset compromised of millions of datapoints
- Both the generator and discriminator continue to refine their abilities until the discriminator can no longer accurately distinguish between the real and fake examples
The most powerful GANs are able to create a variety of synthetic examples, such as realistic images of fictional landscapes, objects, or characters.