Causal Effect Estimation with Learned Instrument Representations

arXiv stat.ML / 4/8/2026

💬 OpinionIdeas & Deep AnalysisModels & Research

Key Points

  • The paper addresses a key limitation of instrumental variable (IV) causal inference: valid instruments are often unavailable or hard to identify in real observational data settings.
  • It proposes a representation-learning framework (ZNet) that derives “instrumental representations” from observed covariates, enabling IV-style estimation even when no explicit instrument exists.
  • ZNet’s architecture is designed to reflect the structural causal model of IVs by decomposing features into confounding and instrumental components and training via moment conditions tied to instrument validity (relevance, exclusion restriction, and instrumental unconfoundedness).
  • The method is positioned as compatible with many downstream two-stage IV estimators, making it a modular component for causal effect estimation.
  • Experiments suggest ZNet can both recover known ground-truth instruments when present and construct latent instruments in an embedding space when they are not.

Abstract

Instrumental variable (IV) methods mitigate bias from unobserved confounding in observational causal inference but rely on the availability of a valid instrument, which can often be difficult or infeasible to identify in practice. In this paper, we propose a representation learning approach that constructs instrumental representations from observed covariates, which enable IV-based estimation even in the absence of an explicit instrument. Our model (ZNet) achieves this through an architecture that mirrors the structural causal model of IVs; it decomposes the ambient feature space into confounding and instrumental components, and is trained by enforcing empirical moment conditions corresponding to the defining properties of valid instruments (i.e., relevance, exclusion restriction, and instrumental unconfoundedness). Importantly, ZNet is compatible with a wide range of downstream two-stage IV estimators of causal effects. Our experiments demonstrate that ZNet can (i) recover ground-truth instruments when they already exist in the ambient feature space and (ii) construct latent instruments in the embedding space when no explicit IVs are available. Our work suggests when ZNet can be used as a module for causal inference in general observational settings.