CIV-DG: Conditional Instrumental Variables for Domain Generalization in Medical Imaging

arXiv cs.CV / 3/27/2026

💬 OpinionIdeas & Deep AnalysisModels & Research

Key Points

  • The paper argues that cross-site generalization in medical imaging AI is undermined by selection bias, where patient demographics non-randomly determine which hospital/scanner they receive, creating spurious correlations with diagnostic labels.
  • It introduces CIV-DG, a causal domain generalization framework that uses Conditional Instrumental Variables to disentangle true pathological semantics from scanner/site artifacts even when standard instrumental variable assumptions do not hold.
  • The method relaxes the usual random-assignment requirement by modeling hospital selection as endogenously driven by demographics, addressing an identifiability barrier that conventional DG methods miss.
  • Implementation uses a DeepGMM-style architecture with a conditional critic to minimize moment violations and enforce orthogonality between instruments and error terms within demographic strata.
  • Experiments on Camelyon17 and large Chest X-Ray datasets show that CIV-DG outperforms existing baselines, supporting the value of conditional causal mechanisms for robust medical AI.

Abstract

Cross-site generalizability in medical AI is fundamentally compromised by selection bias, a structural mechanism where patient demographics (e.g., age, severity) non-randomly dictate hospital assignment. Conventional Domain Generalization (DG) paradigms, which predominantly target image-level distribution shifts, fail to address the resulting spurious correlations between site-specific variations and diagnostic labels. To surmount this identifiability barrier, we propose CIV-DG, a causal framework that leverages Conditional Instrumental Variables to disentangle pathological semantics from scanner-induced artifacts. By relaxing the strict random assignment assumption of standard IV methods, CIV-DG accommodates complex clinical scenarios where hospital selection is endogenously driven by patient demographics. We instantiate this theory via a Deep Generalized Method of Moments (DeepGMM) architecture, employing a conditional critic to minimize moment violations and enforce instrument-error orthogonality within demographic strata. Extensive experiments on the Camelyon17 benchmark and large-scale Chest X-Ray datasets demonstrate that CIV-DG significantly outperforms leading baselines, validating the efficacy of conditional causal mechanisms in resolving structural confounding for robust medical AI.