Proximity Matters: Local Proximity Enhanced Balancing for Treatment Effect Estimation

arXiv stat.ML / 3/26/2026

💬 OpinionIdeas & Deep AnalysisModels & Research

Key Points

  • The paper addresses heterogeneous treatment effect (HTE) estimation from observational data by tackling treatment selection bias, which arises when treated and control groups differ systematically.
  • It proposes CFR-Pro, a Proximity-enhanced Counterfactual Regression method that adds a pair-wise local proximity regularizer (via optimal transport) to better capture local similarity rather than relying only on global latent-space alignment.
  • Because proximity/discrepancy calculations become unreliable in high dimensions with limited data, the authors introduce an informative subspace projector to improve sample complexity by trading off some distance precision.
  • Experiments show CFR-Pro more accurately matches units across treatment groups, reduces selection bias, and outperforms existing baseline methods.
  • The authors provide open-source code for CFR-Pro on GitHub to support replication and practical experimentation.

Abstract

Heterogeneous treatment effect (HTE) estimation from observational data poses significant challenges due to treatment selection bias. Existing methods address this bias by minimizing distribution discrepancies between treatment groups in latent space, focusing on global alignment. However, the fruitful aspect of local proximity, where similar units exhibit similar outcomes, is often overlooked. In this study, we propose Proximity-enhanced CounterFactual Regression (CFR-Pro) to exploit proximity for enhancing representation balancing within the HTE estimation context. Specifically, we introduce a pair-wise proximity regularizer based on optimal transport to incorporate the local proximity in discrepancy calculation. However, the curse of dimensionality renders the proximity measure and discrepancy estimation ineffective -- exacerbated by limited data availability for HTE estimation. To handle this problem, we further develop an informative subspace projector, which trades off minimal distance precision for improved sample complexity. Extensive experiments demonstrate that CFR-Pro accurately matches units across different treatment groups, effectively mitigates treatment selection bias, and significantly outperforms competitors. Code is available at https://github.com/HowardZJU/CFR-Pro.