Sinkhorn-Drifting Generative Models
arXiv cs.LG / 3/16/2026
💬 OpinionIdeas & Deep AnalysisModels & Research
Key Points
- The paper establishes a precise link between drifting generative dynamics and Sinkhorn-divergence gradient flows, showing they share a cross-minus-self structure expressed via normalization kernels.
- In a particle discretization, the drift decomposes into an attractive term toward the target and a repulsive term toward the current model, with Sinkhorn divergence defined via two-sided entropic optimal-transport couplings through Sinkhorn scaling.
- The work resolves an identifiability gap in prior drifting formulations by proving that zero drift implies the model equals the target, thanks to the definiteness of the Sinkhorn divergence.
- Experiments show improved stability and one-step generation quality, with FFHQ-ALAE at low temperature achieving a mean FID drop from 187.7 to 37.1 and a mean latent EMD drop from 453.3 to 144.4, while MNIST preserves full class coverage; the approach trades off additional training time for these gains.




