What is a GAN?

A Generative Adversarial Network (GAN) is a generative AI architecture consisting of two neural networks — a generator and a discriminator — that compete against each other. The generator creates fake data, and the discriminator tries to distinguish it from real data. Through this adversarial training, the generator learns to produce increasingly realistic outputs.

Why It Matters

GANs were the dominant generative image model before diffusion models and remain influential. They pioneered AI-generated faces (StyleGAN), image-to-image translation (pix2pix), and super-resolution. Understanding GANs provides essential context for how generative AI evolved and why diffusion models eventually superseded them for most image generation tasks.

How It Works

Generator — takes random noise as input and produces synthetic data (e.g., a fake image).
Discriminator — receives both real data and generator output, and predicts whether each sample is real or fake.
Adversarial training — the two networks are trained simultaneously:

The discriminator improves at detecting fakes
The generator improves at fooling the discriminator
Training converges when the generator's output is indistinguishable from real data (in theory)

Notable GAN variants:

StyleGAN — generates photorealistic faces with fine-grained style control
CycleGAN — translates between image domains without paired examples (horse ↔ zebra)
Pix2pix — paired image translation (sketch → photo)
SRGAN — super-resolution (enhance low-res images)

GAN challenges:

Mode collapse — the generator produces only a few types of output
Training instability — the adversarial balance is hard to maintain
No density estimation — GANs can't tell you how likely a given sample is

These challenges are why diffusion models have largely replaced GANs for high-quality image generation.

Example

Thispersondoesnotexist.com (now offline) used StyleGAN to generate photorealistic faces of people who don't exist. Each page refresh produced a new, entirely synthetic face — demonstrating both the power of GANs and the deepfake concerns they raised.

What is a GAN?

Why It Matters

How It Works

Generator — takes random noise as input and produces synthetic data (e.g., a fake image).
Discriminator — receives both real data and generator output, and predicts whether each sample is real or fake.
Adversarial training — the two networks are trained simultaneously:

The discriminator improves at detecting fakes
The generator improves at fooling the discriminator
Training converges when the generator's output is indistinguishable from real data (in theory)

Notable GAN variants:

StyleGAN — generates photorealistic faces with fine-grained style control
CycleGAN — translates between image domains without paired examples (horse ↔ zebra)
Pix2pix — paired image translation (sketch → photo)
SRGAN — super-resolution (enhance low-res images)

GAN challenges:

Mode collapse — the generator produces only a few types of output
Training instability — the adversarial balance is hard to maintain
No density estimation — GANs can't tell you how likely a given sample is

These challenges are why diffusion models have largely replaced GANs for high-quality image generation.

What is a GAN (Generative Adversarial Network)?

What is a GAN?

Why It Matters

How It Works

Example

Sources

What is a GAN (Generative Adversarial Network)?

What is a GAN?

Why It Matters

How It Works

Example

Sources