Debiased Machine Learning for Conformal Prediction of Counterfactual Outcomes Under Runtime Confounding

arXiv stat.ML / 4/7/2026

📰 NewsSignals & Early TrendsIdeas & Deep AnalysisModels & Research

Key Points

  • The paper addresses counterfactual outcome prediction across a source and a target population using conformal prediction to produce assumption-lean uncertainty intervals.
  • It highlights a limitation of prior approaches: they require all confounders used in training to be measured in the target population, otherwise prediction intervals may have miscoverage.
  • The authors propose a computationally efficient debiased machine learning framework grounded in semiparametric efficiency theory to maintain valid coverage under “runtime confounding,” where only a subset of confounders is observed in the target.
  • Experiments on synthetic and semi-synthetic data show improved coverage validity and faster convergence relative to standard methods, demonstrating practical usefulness.
  • Overall, the contribution is a new method for producing reliable counterfactual prediction intervals when target-population confounding measurements are incomplete.

Abstract

Data-driven decision making frequently relies on predicting counterfactual outcomes. In practice, researchers commonly train counterfactual prediction models on a source dataset to inform decisions on a possibly separate target population. Conformal prediction has arisen as a popular method for producing assumption-lean prediction intervals for counterfactual outcomes that would arise under different treatment decisions in the target population of interest. However, existing methods require that every confounding factor of the treatment-outcome relationship used for training on the source data is additionally measured in the target population, risking miscoverage if important confounders are unmeasured in the target population. In this paper, we introduce a computationally efficient debiased machine learning framework that allows for valid prediction intervals when only a subset of confounders is measured in the target population, a common challenge referred to as runtime confounding. Grounded in semiparametric efficiency theory, we show the resulting prediction intervals achieve desired coverage rates with faster convergence compared to standard methods. Through numerous synthetic and semi-synthetic experiments, we demonstrate the utility of our proposed method.