Multi-Domain Empirical Bayes for Linearly-Mixed Causal Representations

arXiv stat.ML / 3/24/2026

💬 OpinionIdeas & Deep AnalysisModels & Research

Key Points

  • The paper studies causal representation learning by applying empirical Bayes to estimate low-dimensional causal latent variables from high-dimensional observations.
  • It focuses on multi-domain learning where domain differences are treated as interventions within a shared underlying causal model, turning the task into a simultaneous inference problem suited to EB.
  • The authors introduce an “EB f-modeling” algorithm for a linear measurement setting using an EM-style procedure grounded in causally structured score matching when the causal graph and intervention targets are known.
  • They also discuss an “EB g-modeling” perspective in relation to existing causal representation learning approaches.
  • Experiments on synthetic data show the proposed method can estimate causal representations more accurately than alternative CRL methods.

Abstract

Causal representation learning (CRL) aims to learn low-dimensional causal latent variables from high-dimensional observations. While identifiability has been extensively studied for CRL, estimation has been less explored. In this paper, we explore the use of empirical Bayes (EB) to estimate causal representations. In particular, we consider the problem of learning from data from multiple domains, where differences between domains are modeled by interventions in a shared underlying causal model. Multi-domain CRL naturally poses a simultaneous inference problem that EB is designed to tackle. Here, we propose an EB f-modeling algorithm that improves the quality of learned causal variables by exploiting invariant structure within and across domains. Specifically, we consider a linear measurement model and interventional priors arising from a shared acyclic SCM. When the graph and intervention targets are known, we develop an EM-style algorithm based on causally structured score matching. We further discuss EB g-modeling in the context of existing CRL approaches. In experiments on synthetic data, our proposed method achieves more accurate estimation than other methods for CRL.