Estimating Joint Interventional Distributions from Marginal Interventional Data

arXiv stat.ML / 4/20/2026

💬 OpinionIdeas & Deep AnalysisModels & Research

Key Points

  • The paper presents a method to derive the full joint conditional distribution of variables by leveraging both observational data and interventional (marginal) interventional data using the Maximum Entropy principle.
  • It extends the Causal Maximum Entropy framework to incorporate interventional constraints, and proves via Lagrange duality that the resulting solution remains within the exponential family.
  • The proposed approach supports two key applications when only marginal interventional distributions are available for subsets of variables: causal feature selection from a mixture of data sources and inference of joint interventional distributions.
  • In synthetic experiments, the method improves over the state of the art for merging datasets and achieves performance comparable to the KCI test, which requires access to joint observational data for all variables.

Abstract

In this paper we show how to exploit interventional data to acquire the joint conditional distribution of all the variables using the Maximum Entropy principle. To this end, we extend the Causal Maximum Entropy method to make use of interventional data in addition to observational data. Using Lagrange duality, we prove that the solution to the Causal Maximum Entropy problem with interventional constraints lies in the exponential family, as in the Maximum Entropy solution. Our method allows us to perform two tasks of interest when marginal interventional distributions are provided for any subset of the variables. First, we show how to perform causal feature selection from a mixture of observational and single-variable interventional data, and, second, how to infer joint interventional distributions. For the former task, we show on synthetically generated data, that our proposed method outperforms the state-of-the-art method on merging datasets, and yields comparable results to the KCI-test which requires access to joint observations of all variables.