Extreme Conformal Prediction: Reliable Intervals for High-Impact Events

arXiv stat.ML / 4/2/2026

💬 OpinionIdeas & Deep AnalysisModels & Research

Key Points

  • The paper addresses a key limitation of classical conformal prediction: when regulators demand extremely high confidence levels relative to limited calibration data, standard methods can yield infinitely wide, uninformative intervals.
  • It introduces “extreme conformal prediction,” which combines conformal prediction with extreme value statistics to produce reliable, informative high-confidence prediction intervals.
  • The method is designed to work with any black-box extreme quantile regression model, allowing broad applicability without requiring specialized modeling architectures.
  • It also presents a weighted variant to handle nonstationary data, improving robustness when data distributions change over time.
  • The approach is validated via simulations and an application to flood risk forecasting, demonstrating improved interval informativeness under extreme-confidence requirements.

Abstract

Conformal prediction is a popular method to construct prediction intervals with marginal coverage guarantees from black-box machine learning models. In applications with potentially high-impact events, such as flooding or financial crises, regulators often require very high confidence for such intervals. However, if the desired level of confidence is too large relative to the amount of data used for calibration, then classical conformal methods provide infinitely wide, thus, uninformative prediction intervals. In this paper, we propose a new method to overcome this limitation. We bridge extreme value statistics and conformal prediction to provide reliable and informative prediction intervals with high-confidence coverage, which can be constructed using any black-box extreme quantile regression method. A weighted version of our approach can account for nonstationary data. The advantages of our extreme conformal prediction method are illustrated in a simulation study and in an application to flood risk forecasting.