Wasserstein-p Central Limit Theorem Rates: From Local Dependence to Markov Chains

arXiv stat.ML / 4/21/2026

💬 OpinionIdeas & Deep AnalysisModels & Research

共有:

Key Points

The paper derives non-asymptotic central limit theorem (CLT) convergence rates for multivariate dependent data measured in Wasserstein-p distance for any p≥1.
It establishes the first optimal O(n^{-1/2}) Wasserstein-1 (W1) CLT rate for two common dependence structures: locally dependent sequences and geometrically ergodic Markov chains.
For p≥2, it also provides the first Wasserstein-p (Wp) CLT rates under relatively mild moment assumptions, improving previously known bounds for dependent data.
As an application, the authors obtain a first optimal W1-CLT rate for multivariate U-statistics using their optimal W1 result for locally dependent sequences.
Technically, the work provides a tractable bound for W1 Gaussian approximation errors and proves geometric tails for regeneration times of split chains without requiring strong aperiodicity or other restrictive conditions.

Abstract

Non-asymptotic central limit theorem (CLT) rates play a central role in modern machine learning and operations research. In this paper, we study CLT rates for multivariate dependent data in Wasserstein-

p

(

W_p

) distance, for general

p\ge 1

. We focus on two fundamental dependence structures that commonly arise in practice: locally dependent sequences and geometrically ergodic Markov chains. In both settings, we establish the first optimal

\mathcal O(n^{-1/2})

rate in

W_1

, as well as the first

W_p

(

p\ge 2

) CLT rates under mild moment assumptions, substantially improving the best previously known bounds in these dependent-data regimes. As an application of our optimal

W_1

rate for locally dependent sequences, we further obtain the first optimal

W_1

-CLT rate for multivariate

U

-statistics. On the technical side, we derive a tractable auxiliary bound for

W_1

Gaussian approximation errors that is well suited for studying dependent data. For Markov chains, we further prove that the regeneration time of the split chain associated with a geometrically ergodic chain has a geometric tail without assuming strong aperiodicity or other restrictive conditions. These tools may be of independent interests and enable our optimal

W_1

rates and underpin our

W_p

(

p\ge 2

) results.

Every time a new model comes out, the old one is obsolete of course

Reddit r/LocalLLaMA

We built it during the NVIDIA DGX Spark Full-Stack AI Hackathon — and it ended up winning 1st place overall 🏆

Dev.to

Stop Losing Progress: Setting Up a Pro Jupyter Workflow in VS Code (No More Colab Timeouts!)

Dev.to

Building AgentOS: Why I’m Building the AWS Lambda for Insurance Claims

Dev.to

Where we are. In a year, everything has changed. Kimi - Minimax - Qwen - Gemma - GLM

Reddit r/LocalLLaMA

Wasserstein-p Central Limit Theorem Rates: From Local Dependence to Markov Chains

Key Points

Abstract

Related Articles

Every time a new model comes out, the old one is obsolete of course

We built it during the NVIDIA DGX Spark Full-Stack AI Hackathon — and it ended up winning 1st place overall 🏆

Stop Losing Progress: Setting Up a Pro Jupyter Workflow in VS Code (No More Colab Timeouts!)

Building AgentOS: Why I’m Building the AWS Lambda for Insurance Claims

Where we are. In a year, everything has changed. Kimi - Minimax - Qwen - Gemma - GLM

関連おすすめサービス

Notta搭載AI議事録イヤホン ZENCHORD1

AI搭載ボイスレコーダー Plaud

画像高画質化AIツール Aiarty Image Enhancer