VAEs and Flows

VAEs and normalizing flows are likelihood-oriented generative models. They are useful for understanding latent variables, amortized inference, variational bounds, vector quantization, exact likelihoods, and invertible transformations.

Variational Autoencoders and Discrete Latents

YearPaperTopicNote
2013Auto-Encoding Variational BayesVAEReparameterization trick and amortized variational inference.
2015Importance Weighted AutoencodersIWAETighter variational bound using importance-weighted samples.
2016beta-VAEDisentanglementAdjusts the KL weight to encourage factorized latent structure.
2017Neural Discrete Representation LearningVQ-VAEDiscrete latent codes learned with vector quantization.
2019Generating Diverse High-Fidelity Images with VQ-VAE-2VQ-VAE-2Hierarchical discrete latents for high-quality image synthesis.
2020Very Deep VAEs Generalize Autoregressive Models and Can Outperform Them on ImagesVDVAEVery deep hierarchical VAE with strong likelihood performance.
2020NVAEDeep VAEScalable normalizing-flow-enhanced VAE architecture.

Normalizing Flows

YearPaperTopicNote
2014NICE: Non-linear Independent Components EstimationCoupling flowsEarly tractable invertible density model.
2015Variational Inference with Normalizing FlowsNormalizing flowsFlexible posteriors via invertible transformations.
2016Improving Variational Inference with Inverse Autoregressive FlowIAFAutoregressive flow for richer approximate posteriors.
2016Density Estimation Using Real NVPReal NVPCoupling layers with exact likelihood and sampling.
2017Masked Autoregressive Flow for Density EstimationMAFAutoregressive density estimator as a flow.
2018Glow (code)FlowInvertible 1x1 convolutions for scalable image flows.
2018FFJORDContinuous normalizing flowUnbiased trace estimator for scalable continuous-time flows.
2019Neural Spline FlowsSpline flowsMonotonic rational-quadratic splines for flexible invertible transforms.

Reading Path

StepRead
1Auto-Encoding Variational Bayes and IWAE.
2VQ-VAE and VQ-VAE-2 for discrete latent tokenization.
3NICE, Real NVP, Glow, and Neural Spline Flows for exact-likelihood flows.
4FFJORD before moving to Flow Matching and Fast Sampling.