AI Navigate

Evaluating Causal Discovery Algorithms for Path-Specific Fairness and Utility in Healthcare

arXiv cs.LG / 3/18/2026

📰 NewsIdeas & Deep AnalysisModels & Research

Key Points

  • The authors address evaluation challenges in causal discovery for healthcare by constructing proxy ground-truth graphs with clinical experts and benchmarking on synthetic Alzheimer's disease and heart failure records.
  • They evaluate the Peter-Clark, Greedy Equivalence Search, and Fast Causal Inference algorithms across structural recovery and path-specific fairness decomposition, going beyond composite fairness scores.
  • On synthetic data, Peter-Clark achieved the best structural recovery, while on heart failure data, Fast Causal Inference yielded the highest utility.
  • The study shows that path-specific effects, such as the ejection fraction, contribute measurable indirect effects (3.37 percentage points), driving variations in the fairness-utility ratio across algorithms.
  • The results underscore the need for graph-aware fairness evaluation and fine-grained path-specific analysis when deploying causal discovery in clinical settings.

Abstract

Causal discovery in health data faces evaluation challenges when ground truth is unknown. We address this by collaborating with experts to construct proxy ground-truth graphs, establishing benchmarks for synthetic Alzheimer's disease and heart failure clinical records data. We evaluate the Peter-Clark, Greedy Equivalence Search, and Fast Causal Inference algorithms on structural recovery and path-specific fairness decomposition, going beyond composite fairness scores. On synthetic data, Peter-Clark achieved the best structural recovery. On heart failure data, Fast Causal Inference achieved the highest utility. For path-specific effects, ejection fraction contributed 3.37 percentage points to the indirect effect in the ground truth. These differences drove variations in the fairness-utility ratio across algorithms. Our results highlight the need for graph-aware fairness evaluation and fine-grained path-specific analysis when deploying causal discovery in clinical applications.