Interpolating Discrete Diffusion Models with Controllable Resampling

arXiv cs.LG / 4/21/2026

📰 NewsModels & Research

Key Points

  • The paper introduces IDDM (Interpolating Discrete Diffusion Model) to address quality and error-accumulation issues in existing discrete diffusion approaches such as masked and uniform diffusion models.
  • IDDM uses a controllable resampling mechanism that partially resets probability mass to the marginal distribution, reducing reliance on intermediate latent states.
  • The proposed generative transitions interpolate between keeping the current state, resampling from a prior, and moving (flipping) toward the target state.
  • The method enforces marginal consistency and claims to fully decouple training from inference, potentially simplifying deployment and improving robustness.
  • Experiments on molecular graph generation and text generation show IDDM achieves competitive results versus state-of-the-art discrete diffusion models.

Abstract

Discrete diffusion models form a powerful class of generative models across diverse domains, including text and graphs. However, existing approaches face fundamental limitations. Masked diffusion models suffer from irreversible errors due to early unmasking, while uniform diffusion models, despite enabling self-correction, often yield low-quality samples due to their strong reliance on intermediate latent states. We introduce IDDM, an Interpolating Discrete Diffusion Model, that improves diffusion by reducing dependence on intermediate latent states. Central to IDDM is a controllable resampling mechanism that partially resets probability mass to the marginal distribution, mitigating error accumulation and enabling more effective token corrections. IDDM specifies a generative process whose transitions interpolate between staying at the current state, resampling from a prior, and flipping toward the target state, while enforcing marginal consistency and fully decoupling training from inference. We benchmark our model against state-of-the-art discrete diffusion models across molecular graph generation as well as text generation tasks, demonstrating competitive performance.