Causal Diffusion Models for Counterfactual Outcome Distributions in Longitudinal Data

arXiv stat.ML / 4/15/2026

💬 OpinionIdeas & Deep AnalysisModels & Research

共有:

Key Points

複数時点で治療判断が患者状態に依存する縦断データにおける反実仮想のアウトカム予測は、時点依存の交絡や不十分な不確実性推定のため難しいという課題を扱っています。
提案手法として、縦断的な逐次介入下で反実仮想アウトカムの「分布全体」を生成することを目的に設計されたCausal Diffusion Model（CDM）を提示しています。
CDMは残差型のdenoisingアーキテクチャとリレーショナル・セルフアテンションを用い、複雑な時間依存性や多峰性の軌跡を捉えつつ、従来の明示的なdeconfounding（例：逆確率重み付けや対立的バランシング）を不要にするとしています。
既存研究でも用いられる薬物動態-薬力学の腫瘍成長シミュレータで評価した結果、最先端の縦断因果推論手法に対して分布精度（1-Wasserstein距離）で15〜30%の相対改善を達成し、点推定精度（RMSE）も高交絡条件で競争的または優位であったと報告されています。
不確実性定量と堅牢な反実仮想予測を、逐次交絡が強い状況に統合した意思決定支援ツールとしての有用性が示されています。

Abstract

Predicting counterfactual outcomes in longitudinal data, where sequential treatment decisions heavily depend on evolving patient states, is critical yet notoriously challenging due to complex time-dependent confounding and inadequate uncertainty quantification in existing methods. We introduce the Causal Diffusion Model (CDM), the first denoising diffusion probabilistic approach explicitly designed to generate full probabilistic distributions of counterfactual outcomes under sequential interventions. CDM employs a novel residual denoising architecture with relational self-attention, capturing intricate temporal dependencies and multimodal outcome trajectories without requiring explicit adjustments (e.g., inverse-probability weighting or adversarial balancing) for confounding. In rigorous evaluation on a pharmacokinetic-pharmacodynamic tumor-growth simulator widely adopted in prior work, CDM consistently outperforms state-of-the-art longitudinal causal inference methods, achieving a 15-30% relative improvement in distributional accuracy (1-Wasserstein distance) while maintaining competitive or superior point-estimate accuracy (RMSE) under high-confounding regimes. By unifying uncertainty quantification and robust counterfactual prediction in complex, sequentially confounded settings, without tailored deconfounding, CDM offers a flexible, high-impact tool for decision support in medicine, policy evaluation, and other longitudinal domains.

Are gamers being used as free labeling labor? The rise of "Simulators" that look like AI training grounds [D]

Reddit r/MachineLearning

Big Tech firms are accelerating AI investments and integration, while regulators and companies focus on safety and responsible adoption.

Dev.to

Big Tech firms are accelerating AI investments and integration, while regulators and companies focus on safety and responsible adoption.

Dev.to

Failure to Reproduce Modern Paper Claims [D]

Reddit r/MachineLearning

Why don’t they just use Mythos to fix all the bugs in Claude Code?

Reddit r/LocalLLaMA

Causal Diffusion Models for Counterfactual Outcome Distributions in Longitudinal Data

Key Points

Abstract

Related Articles

Are gamers being used as free labeling labor? The rise of "Simulators" that look like AI training grounds [D]

Big Tech firms are accelerating AI investments and integration, while regulators and companies focus on safety and responsible adoption.

Big Tech firms are accelerating AI investments and integration, while regulators and companies focus on safety and responsible adoption.

Failure to Reproduce Modern Paper Claims [D]

Why don’t they just use Mythos to fix all the bugs in Claude Code?

関連おすすめサービス

Notta搭載AI議事録イヤホン ZENCHORD1

AI搭載ボイスレコーダー Plaud

画像高画質化AIツール Aiarty Image Enhancer