Diffusion Processes on Implicit Manifolds

arXiv cs.LG / 4/9/2026

💬 OpinionIdeas & Deep AnalysisModels & Research

Key Points

  • The paper proposes a data-driven stochastic differential equation (SDE) to model diffusion on an underlying low-dimensional manifold using only point-cloud samples, without requiring charts, projections, or other explicit geometric primitives.
  • It constructs the diffusion in the ambient space by estimating the diffusion’s infinitesimal generator and its carré-du-champ (CDC) from a proximity graph built from the data, which jointly capture local stochastic and geometric structure.
  • The authors prove that as the number of samples increases, the resulting implicit diffusion process converges in law to the corresponding diffusion defined on the smooth manifold counterpart.
  • The work introduces “Implicit Manifold-valued Diffusions (IMDs)” and provides a numerical simulation method via Euler-Maruyama integration to enable practical approximation of the learned manifold-aware diffusion dynamics.
  • The approach is positioned as a rigorous foundation for manifold-aware sampling, exploration, and generative modeling, and it motivates further research directions in using intrinsic diffusion for machine learning tasks.

Abstract

High-dimensional data are often modeled as lying near a low-dimensional manifold. We study how to construct diffusion processes on this data manifold in the implicit setting. That is, using only point cloud samples and without access to charts, projections, or other geometric primitives. Our main contribution is a data-driven SDE that captures intrinsic diffusion on the underlying manifold while being defined in ambient space. The construction relies on estimating the diffusion's infinitesimal generator and its carr\'e-du-champ (CDC) from a proximity graph built from the data. The generator and CDC together encode the local stochastic and geometric structure of the intended diffusion. We show that, as the number of samples grows, the induced process converges in law on the space of probability paths to its smooth manifold counterpart. We call this construction Implicit Manifold-valued Diffusions (IMDs), and furthermore present a numerical simulation procedure using Euler-Maruyama integration. This gives a rigorous basis for practical implementations of diffusion dynamics on data manifolds, and opens new directions for manifold-aware sampling, exploration, and generative modeling.