High Probability Complexity Bounds of Trust-Region Stochastic Sequential Quadratic Programming with Heavy-Tailed Noise

arXiv stat.ML / 4/2/2026

💬 OpinionIdeas & Deep AnalysisModels & Research

Key Points

  • The paper introduces a Trust-Region Stochastic Sequential Quadratic Programming (TR-SSQP) algorithm for nonlinear optimization with stochastic objectives and deterministic equality constraints, where objective derivatives are accessed only through probabilistic (zeroth-, first-, second-order) oracles.
  • It derives high-probability iteration complexity bounds for finding both first-order and second-order $5$-stationary points, explicitly extending prior SSQP complexity work that mainly assumed light-tailed noise and focused largely on first-order stationarity.
  • Unlike light-tailed settings, the analysis accommodates heavy-tailed noise and biased/irreducible estimation errors in the zeroth-order oracle, showing the method preserves the same high-probability first-order iteration complexity.
  • Under the stated heavy-tailed and bias conditions, the method achieves $\mathcal{O}(\epsilon^{-2})$ iterations to reach a first-order $\epsilon$-stationary point and $\mathcal{O}(\epsilon^{-3})$ iterations for a second-order $\epsilon$-stationary point with high probability.
  • The authors validate the theoretical results with experiments and performance evaluations on the CUTEst benchmark test set.

Abstract

In this paper, we consider nonlinear optimization problems with a stochastic objective and deterministic equality constraints. We propose a Trust-Region Stochastic Sequential Quadratic Programming (TR-SSQP) method and establish its high-probability iteration complexity bounds for identifying first- and second-order \epsilon-stationary points. In our algorithm, we assume that exact objective values, gradients, and Hessians are not directly accessible but can be estimated via zeroth-, first-, and second-order probabilistic oracles. Compared to existing complexity studies of SSQP methods that rely on a zeroth-order oracle with sub-exponential tail noise (i.e., light-tailed) and focus mostly on first-order stationarity, our analysis accommodates biased (also referred to as irreducible in the literature) and heavy-tailed noise in the zeroth-order oracle, and significantly extends the analysis to second-order stationarity. We show that under heavy-tailed noise conditions, our SSQP method achieves the same high-probability first-order iteration complexity bounds as in the light-tailed noise setting, while further exhibiting promising second-order iteration complexity bounds. Specifically, the method identifies a first-order \epsilon-stationary point in \mathcal{O}(\epsilon^{-2}) iterations and a second-order \epsilon-stationary point in \mathcal{O}(\epsilon^{-3}) iterations with high probability, provided that \epsilon is lower bounded by a constant determined by the bias magnitude (i.e., the irreducible noise) in the estimation. We validate our theoretical findings and evaluate practical performance of our method on CUTEst benchmark test set.