Bias-Corrected Adaptive Conformal Inference for Multi-Horizon Time Series Forecasting

arXiv cs.LG / 4/16/2026

💬 OpinionIdeas & Deep AnalysisModels & Research

Key Points

  • The paper introduces Bias-Corrected Adaptive Conformal Inference (BC-ACI) to improve distribution-shift robustness in multi-horizon time series forecasting beyond standard Adaptive Conformal Inference (ACI).
  • BC-ACI addresses ACI’s limitation of only adapting interval width by estimating persistent forecast bias online (via an exponentially weighted moving average) and re-centering prediction intervals through bias-adjusted nonconformity scores.
  • An adaptive dead-zone threshold prevents unnecessary corrections when the estimated bias is within noise, aiming to avoid harming already well-calibrated data.
  • Across 688 experimental runs using two base models, four synthetic regimes, and three real datasets, BC-ACI reduces Winkler interval scores by 13–17% under mean/compound distribution shifts while preserving performance on stationary data.
  • The authors provide finite-sample analysis showing that coverage guarantees degrade gracefully as bias estimation error increases.

Abstract

Adaptive Conformal Inference (ACI) provides distribution-free prediction intervals with asymptotic coverage guarantees for time series under distribution shift. However, ACI only adapts the quantile threshold -- it cannot shift the interval center. When a base forecaster develops persistent bias after a regime change, ACI compensates by widening intervals symmetrically, producing unnecessarily conservative bands. We propose Bias-Corrected ACI (BC-ACI), which augments standard ACI with an online exponentially weighted moving average (EWM) estimate of forecast bias. BC-ACI corrects nonconformity scores before quantile computation and re-centers prediction intervals, addressing the root cause of miscalibration rather than its symptom. An adaptive dead-zone threshold suppresses corrections when estimated bias is indistinguishable from noise, ensuring no degradation on well-calibrated data. In controlled experiments across 688 runs spanning two base models, four synthetic regimes, and three real datasets, BC-ACI reduces Winkler interval scores by 13--17% under mean and compound distribution shifts (Wilcoxon p < 0.001) while maintaining equivalent performance on stationary data (ratio 1.002x). We provide finite-sample analysis showing that coverage guarantees degrade gracefully with bias estimation error.