Concepedia

TLDR

DDPMs generate high‑quality images but require many Markov chain steps, with the generative process defined as the reverse of a Markovian diffusion. The authors propose DDIMs to accelerate sampling while using the same training procedure as DDPMs. DDIMs employ non‑Markovian diffusion processes that preserve the training objective yet allow a faster reverse sampling process. DDIMs generate high‑quality samples up to 50× faster than DDPMs, enable a trade‑off between computation and quality, and support semantically meaningful latent‑space interpolation.

Abstract

Denoising diffusion probabilistic models (DDPMs) have achieved high quality image generation without adversarial training, yet they require simulating a Markov chain for many steps to produce a sample. To accelerate sampling, we present denoising diffusion implicit models (DDIMs), a more efficient class of iterative implicit probabilistic models with the same training procedure as DDPMs. In DDPMs, the generative process is defined as the reverse of a Markovian diffusion process. We construct a class of non-Markovian diffusion processes that lead to the same training objective, but whose reverse process can be much faster to sample from. We empirically demonstrate that DDIMs can produce high quality samples $10 \times$ to $50 \times$ faster in terms of wall-clock time compared to DDPMs, allow us to trade off computation for sample quality, and can perform semantically meaningful image interpolation directly in the latent space.

References

YearCitations

Page 1