Diffusion Models

Diffusion models generate data by learning to reverse a gradual noising process. This page keeps the core diffusion literature separate from text-to-image system reports and flow-matching/fast-sampling papers.

Surveys and Monographs

YearPaperTopicNote
2025The Principles of Diffusion ModelsMonograph / surveyUnifies variational, score-based, and flow-based views of diffusion models.

Origins and Core Theory

YearPaperTopicNote
2015Deep Unsupervised Learning using Nonequilibrium ThermodynamicsEarly diffusionEarly noising-reversal view of generative modeling.
2019Generative Modeling by Estimating Gradients of the Data DistributionNCSNNoise-conditioned score networks for score-based generation.
2020Denoising Diffusion Probabilistic Models (code)DDPMModern denoising diffusion formulation.
2020Denoising Diffusion Implicit ModelsDDIMFaster non-Markovian sampling with DDPM training.
2020Score-Based Generative Modeling through Stochastic Differential EquationsScore-based SDEsContinuous-time unification of diffusion and score models.
2021Improved Denoising Diffusion Probabilistic Models (code)Improved DDPMLearned variances and improved sampling/likelihoods.

Guidance, Latents, and Architecture

YearPaperTopicNote
2021Diffusion Models Beat GANs on Image Synthesis (code)Classifier guidanceShows diffusion can beat GANs in image quality with improved architectures and guidance.
2021High-Resolution Image Synthesis with Latent Diffusion Models (code)Latent diffusionDiffusion in autoencoder latent space; Stable Diffusion lineage.
2022Classifier-Free Diffusion GuidanceGuidanceConditional guidance without an external classifier.
2022Elucidating the Design Space of Diffusion-Based Generative ModelsEDMModular analysis of diffusion design choices.
2022Scalable Diffusion Models with TransformersDiTReplaces U-Net backbones with scalable diffusion transformers.
2023Adding Conditional Control to Text-to-Image Diffusion ModelsControlNetAdds spatial controls to pretrained text-to-image diffusion models.

Cross-Database Pointers

ThemeGo ToNote
Text-to-image systemsText-to-Image and Video SystemsGLIDE, DALL-E 2/3, Imagen, SDXL, Sora, and related system papers live there.
Fast sampling and flow-based successorsFlow Matching and Fast SamplingDPM-Solver, consistency models, rectified flow, flow matching, and Stable Diffusion 3 live there.

Reading Path

StepRead
1The Principles of Diffusion Models for the unified conceptual map.
2Nonequilibrium Thermodynamics, NCSN, and DDPM.
3DDIM, Score-Based SDEs, and Improved DDPM.
4Guided Diffusion, Classifier-Free Guidance, and EDM.
5Latent Diffusion, DiT, and ControlNet.