MCMC-Correction of Score-Based Diffusion Models for Model Composition

arXiv stat.ML / 4/2/2026

💬 OpinionIdeas & Deep AnalysisModels & Research

Key Points

  • The paper studies how diffusion models can be sampled using either energy-based parameterizations (enabling Metropolis–Hastings corrections) or score-based parameterizations (which typically lack an explicit energy function).
  • It proposes an MCMC-correction approach for score-based diffusion models by introducing an MH-like acceptance rule derived from line integration of the score function, avoiding the need for explicit energy modeling.
  • The method is framed as an instance of annealed MCMC that allows composing pre-trained score-based diffusion models to sample from new target distributions.
  • Experiments on both synthetic and real-world datasets show relative sampling improvements comparable in magnitude to those achieved by energy-based models, while retaining the ability to reuse existing score-model ecosystems.

Abstract

Diffusion models can be parameterized in terms of either score or energy function. The energy parameterization is attractive as it enables sampling procedures such as Markov Chain Monte Carlo (MCMC) that incorporates a Metropolis--Hastings (MH) correction step based on energy differences between proposed samples. Such corrections can significantly improve sampling quality, particularly in the context of model composition, where pre-trained models are combined to generate samples from novel distributions. Score-based diffusion models, on the other hand, are more widely adopted and come with a rich ecosystem of pre-trained models. However, they do not, in general, define an underlying energy function, making MH-based sampling inapplicable. In this work, we address this limitation by retaining score parameterization and introducing a novel MH-like acceptance rule based on line integration of the score function. This allows the reuse of existing diffusion models while still combining the reverse process with various MCMC techniques, viewed as an instance of annealed MCMC. Through experiments on synthetic and real-world data, we show that our MH-like samplers {yield relative improvements of similar magnitude to those observed} with energy-based models, without requiring explicit energy parameterization.