Weighted Bayesian Conformal Prediction

arXiv cs.LG / 4/9/2026

📰 NewsSignals & Early TrendsIdeas & Deep AnalysisModels & Research

Key Points

  • The paper introduces Weighted Bayesian Conformal Prediction (WBCP), extending Bayesian Quadrature-based conformal prediction to importance-weighted settings under distribution shift.
  • It replaces the uniform Dirichlet posterior used in BQ-CP with a weighted Dirichlet prior parameterized by Kish’s effective sample size (n_eff) and the importance weights, addressing the i.i.d.-dependent limitation of prior Bayesian CP work.
  • The authors prove results linking n_eff to matching frequentist vs Bayesian variances, establishing that posterior uncertainty shrinks at rate O(1/√n_eff), and extending stochastic dominance and data-conditional coverage guarantees.
  • For spatial problems, they instantiate WBCP as Geographical BQ-CP using kernel-based spatial weights, producing per-location posteriors with interpretable diagnostics.
  • Experiments on both synthetic and real datasets show WBCP maintains coverage guarantees while delivering richer, data-conditional uncertainty than traditional weighted (frequentist) conformal approaches.

Abstract

Conformal prediction provides distribution-free prediction intervals with finite-sample coverage guarantees, and recent work by Snell \& Griffiths reframes it as Bayesian Quadrature (BQ-CP), yielding powerful data-conditional guarantees via Dirichlet posteriors over thresholds. However, BQ-CP fundamentally requires the i.i.d. assumption -- a limitation the authors themselves identify. Meanwhile, weighted conformal prediction handles distribution shift via importance weights but remains frequentist, producing only point-estimate thresholds. We propose \textbf{Weighted Bayesian Conformal Prediction (WBCP)}, which generalizes BQ-CP to arbitrary importance-weighted settings by replacing the uniform Dirichlet \Dir(1,\ldots,1) with a weighted Dirichlet \Dir( eff \cdot \tilde{w}_1, \ldots, eff \cdot \tilde{w}_n), where eff is Kish's effective sample size. We prove four theoretical results: (1)~ eff is the unique concentration parameter matching frequentist and Bayesian variances; (2)~posterior standard deviation decays as O(1/\sqrt{ eff}); (3)~BQ-CP's stochastic dominance guarantee extends to per-weight-profile data-conditional guarantees; (4)~the HPD threshold provides O(1/\sqrt{ eff}) improvement in conditional coverage. We instantiate WBCP for spatial prediction as \emph{Geographical BQ-CP}, where kernel-based spatial weights yield per-location posteriors with interpretable diagnostics. Experiments on synthetic and real-world spatial datasets demonstrate that WBCP maintains coverage guarantees while providing substantially richer uncertainty information.

Weighted Bayesian Conformal Prediction | AI Navigate