Flow Matching is Adaptive to Manifold Structures

arXiv stat.ML / 4/10/2026

💬 OpinionIdeas & Deep AnalysisModels & Research

Key Points

  • The paper studies flow matching—an ODE-based, simulation-free alternative to diffusion models—specifically when the target data distribution lies on a low-dimensional smooth manifold rather than having a full-dimensional smooth density.
  • It analyzes flow matching with linear interpolation and provides a non-asymptotic convergence guarantee for the learned time-dependent velocity field under manifold support assumptions.
  • The work propagates velocity-field estimation error through the learned ODE to prove statistical consistency of the implicit density estimator induced by the flow-matching objective.
  • It derives a convergence rate that is near minimax-optimal and depends mainly on the manifold’s intrinsic dimension, explicitly linking performance gains to intrinsic geometry and smoothness.
  • Overall, the results offer a theoretical explanation for why flow matching can adapt to manifold-structured data and mitigate the curse of dimensionality observed in applications like text-to-image, video, and molecular generation.

Abstract

Flow matching has emerged as a simulation-free alternative to diffusion-based generative modeling, producing samples by solving an ODE whose time-dependent velocity field is learned along an interpolation between a simple source distribution (e.g., a standard normal) and a target data distribution. Flow-based methods often exhibit greater training stability and have achieved strong empirical performance in high-dimensional settings where data concentrate near a low-dimensional manifold, such as text-to-image synthesis, video generation, and molecular structure generation. Despite this success, existing theoretical analyses of flow matching assume target distributions with smooth, full-dimensional densities, leaving its effectiveness in manifold-supported settings largely unexplained. To this end, we theoretically analyze flow matching with linear interpolation when the target distribution is supported on a smooth manifold. We establish a non-asymptotic convergence guarantee for the learned velocity field, and then propagate this estimation error through the ODE to obtain statistical consistency of the implicit density estimator induced by the flow-matching objective. The resulting convergence rate is near minimax-optimal, depends only on the intrinsic dimension, and reflects the smoothness of both the manifold and the target distribution. Together, these results provide a principled explanation for how flow matching adapts to intrinsic data geometry and circumvents the curse of dimensionality.