Conformal Robust Set Estimation

arXiv stat.ML / 4/21/2026

💬 OpinionIdeas & Deep AnalysisModels & Research

Key Points

  • The paper addresses a weakness of standard conformal prediction, namely limited robustness when data contain outliers or heavy tails.
  • It introduces a robust conformal approach using a non-conformity score based on a “half-mass radius” around each point, equivalently the distance to its (⌊n/2⌋+1)-nearest neighbor.
  • The authors prove the resulting conformal sets are marginally valid for any sample size and that they converge to a robust population central set defined via a distance-to-a-measure functional.
  • Under mild regularity assumptions, the method also comes with exponential concentration results and tail bounds that quantify how close the empirical conformal region is to its population target.
  • Overall, the work provides a probabilistic theoretical justification for using robust geometric scores in conformal prediction, including for heavy-tailed and multi-modal distributions.

Abstract

Conformal prediction provides finite-sample, distribution-free coverage under exchangeability, but standard constructions may lack robustness in the presence of outliers or heavy tails. We propose a robust conformal method based on a non-conformity score defined as the half-mass radius around a point, equivalently the distance to its (\lfloor n/2\rfloor+1)-nearest neighbour. We show that the resulting conformal regions are marginally valid for any sample size and converge in probability to a robust population central set defined through a distance-to-a-measure functional. Under mild regularity conditions, we establish exponential concentration and tail bounds that quantify the deviation between the empirical conformal region and its population counterpart. These results provide a probabilistic justification for using robust geometric scores in conformal prediction, even for heavy-tailed or multi-modal distributions.