Tight Bounds for Schr\"odinger Potential Estimation in Unpaired Data Translation

arXiv stat.ML / 2026/3/24

💬 オピニオンIdeas & Deep AnalysisModels & Research

要点

  • The paper studies unpaired data translation and generative modeling using Schrödinger bridges and stochastic optimal control when only i.i.d. samples from the source and target distributions are available.
  • It uses an Ornstein–Uhlenbeck process as the reference and focuses on estimating the associated Schrödinger potential from data.
  • By defining a risk function based on the KL divergence between couplings, the authors derive tight generalization bounds for empirical risk minimization over a class of Schrödinger potentials, including Gaussian mixtures.
  • The mixing properties of the Ornstein–Uhlenbeck reference enable near-fast convergence rates in favorable cases, though with some logarithmic factors.
  • The work includes numerical experiments demonstrating the practical performance of the proposed approach.

Abstract

Modern methods of generative modelling and unpaired data translation based on Schr\"odinger bridges and stochastic optimal control theory aim to transform an initial density to a target one in an optimal way. In the present paper, we assume that we only have access to i.i.d. samples from the initial and final distributions. This makes our setup suitable for both generative modelling and unpaired data translation. Relying on the stochastic optimal control approach, we choose an Ornstein-Uhlenbeck process as the reference one and estimate the corresponding Schr\"odinger potential. Introducing a risk function as the Kullback-Leibler divergence between couplings, we derive tight bounds on the generalization ability of an empirical risk minimizer over a class of Schr\"odinger potentials, including Gaussian mixtures. Thanks to the mixing properties of the Ornstein-Uhlenbeck process, we almost achieve fast rates of convergence, up to some logarithmic factors, in favourable scenarios. We also illustrate the performance of the suggested approach with numerical experiments.