Information-theoretic signatures of causality in Bayesian networks and hypergraphs

arXiv stat.ML / 4/14/2026

💬 OpinionSignals & Early TrendsIdeas & Deep AnalysisModels & Research

Key Points

  • The paper addresses how to connect higher-order information theory to causal discovery by relating Partial Information Decomposition (PID) components to causal structure in multivariate systems.
  • It proves that, in Bayesian networks, PID’s unique information corresponds to direct causal neighbors while synergy corresponds to collider relationships, enabling a local-information-based approach to causal discovery.
  • It extends the theory to causal hypergraphs, showing that PID signatures can distinguish causal roles such as parents, children, co-heads, and co-tails, including a collider effect specific to multi-tail hyperedges.
  • The authors position PID as a rigorous, model-agnostic foundation for inferring both pairwise and higher-order causal structure without relying on global search over graph structures.

Abstract

Analyzing causality in multivariate systems involves establishing how information is generated, distributed and combined. Traditional causal discovery frameworks are capable of multivariate reasoning but their intrinsic pairwise graph topology restricts them to do so only indirectly by integrating multivariate information across pairwise edges. Higher-order information theory provides direct tools that can explicitly model higher-order interactions. In particular, Partial Information Decomposition (PID) allows the decomposition of the information that a set of sources provides about a target into redundant, unique, and synergistic components. Yet the mathematical connection between such higher-order information-theoretic measures and causal structure remains undeveloped. Here we establish the first theoretical correspondence between PID components and causal structure in both Bayesian networks and hypergraphs. We first show that in Bayesian networks unique information precisely characterizes direct causal neighbors, while synergy identifies collider relationships. This establishes a localist causal discovery paradigm in which the structure surrounding each variable can be recovered from its immediate informational footprint, eliminating the need for global search over graph space. Extending these results to more expressive causal representation, we prove that PID signatures in Bayesian hypergraphs differentiate parents, children, co-heads, and co-tails, revealing a novel collider effect unique to multi-tail hyperedges. Our results position PID as a rigorous, model-agnostic foundation for inferring both pairwise and higher-order causal structure, and introduce a fundamentally local information-theoretic viewpoint on causal discovery.