From Cross-Validation to SURE: Asymptotic Risk of Tuned Regularized Estimators

arXiv stat.ML / 3/24/2026

💬 OpinionIdeas & Deep AnalysisModels & Research

Key Points

  • The paper analyzes regularized ERM models whose tuning parameters are selected via n-fold cross-validation, deriving the resulting asymptotic out-of-sample risk.
  • It shows that the cross-validated estimator’s predictive loss converges in distribution to the squared-error risk of corresponding shrinkage estimators in the normal means model, using Stein’s Unbiased Risk Estimate (SURE) as the tuning target.
  • The authors argue that this SURE-based risk function offers a more detailed performance characterization than standard learning-theory worst-case regret bounds, because it reveals how risk changes with the true underlying parameter.
  • Key technical results include uniform convergence of n-fold CV to SURE and a “well-separation” property: although SURE can have multiple local minima, its global minimum is generically well separated.
  • Well-separation is used to justify that uniform CV-to-SURE convergence implies convergence of the CV-selected tuning parameter to the SURE-selected one.

Abstract

We derive the asymptotic risk function of regularized empirical risk minimization (ERM) estimators tuned by n-fold cross-validation (CV). The out-of-sample prediction loss of such estimators converges in distribution to the squared-error loss (risk function) of shrinkage estimators in the normal means model, tuned by Stein's unbiased risk estimate (SURE). This risk function provides a more fine-grained picture of predictive performance than uniform bounds on worst-case regret, which are common in learning theory: it quantifies how risk varies with the true parameter. As key intermediate steps, we show that (i) n-fold CV converges uniformly to SURE, and (ii) while SURE typically has multiple local minima, its global minimum is generically well separated. Well-separation ensures that uniform convergence of CV to SURE translates into convergence of the tuning parameter chosen by CV to that chosen by SURE.