Binomial flows: Denoising and flow matching for discrete ordinal data

arXiv cs.LG / 5/4/2026

📰 NewsIdeas & Deep AnalysisModels & Research

Key Points

  • The paper addresses a gap in discrete flow-based generative modeling by relating denoisers to score functions, analogous to Tweedie’s formula in continuous spaces.
  • It introduces “Binomial flows,” a framework tailored to discrete, non-negative ordinal data that links training denoising with sampling using score-like quantities.
  • The method provides a single training recipe for a discrete diffusion model that can denoise, generate samples, and compute exact likelihoods.
  • The approach is validated through experiments on synthetic data and shown to achieve competitive performance on real-world datasets.

Abstract

Flow-based generative modeling in continuous spaces exploit Tweedie's formula to express the denoiser (learned in training) as a score function (used in sampling). In contrast, this relation has been largely missing in the discrete setting where common approaches focus on learning discrete scores and rates. In this work we close this gap for discrete non-negative ordinal data by introducing Binomial flows. Our framework provides a simple recipe for training a discrete diffusion model which simultaneously denoises, samples, and estimates exact likelihoods. We verify our methodology on synthetic examples and obtain competitive results on real-world data sets.