High Confidence Level Inference is Almost Free using Parallel Stochastic Optimization

arXiv stat.ML / 2026/3/24

💬 オピニオンIdeas & Deep AnalysisModels & Research

要点

  • The paper proposes an online inference method that builds t-based confidence intervals for stochastic-optimization estimators using only a small number of independent multi-runs to capture distribution information.
  • It claims the inference step is “almost cost-free,” requiring minimal extra computation and memory beyond the normal online estimation updates.
  • The authors provide theoretical coverage guarantees, including an explicit convergence rate toward nominal confidence levels and a newly developed Gaussian approximation result to characterize interval coverage via relative errors.
  • The approach is designed for easy integration with existing stochastic algorithms and can exploit parallel computing across multiple cores to speed up computation.

Abstract

Uncertainty quantification for estimation through stochastic optimization solutions in an online setting has gained popularity recently. This paper introduces a novel inference method focused on constructing confidence intervals with efficient computation and fast convergence to the nominal level. Specifically, we propose to use a small number of independent multi-runs to acquire distribution information and construct a t-based confidence interval. Our method requires minimal additional computation and memory beyond the standard updating of estimates, making the inference process almost cost-free. We provide a rigorous theoretical guarantee for the confidence interval, demonstrating that the coverage is approximately exact with an explicit convergence rate and allowing for high confidence level inference. In particular, a new Gaussian approximation result is developed for the online estimators to characterize the coverage properties of our confidence intervals in terms of relative errors. Additionally, our method also allows for leveraging parallel computing to further accelerate calculations using multiple cores. It is easy to implement and can be integrated with existing stochastic algorithms without the need for complicated modifications.