Dual Causal Inference: Integrating Backdoor Adjustment and Instrumental Variable Learning for Medical VQA
arXiv cs.CV / 4/23/2026
📰 NewsIdeas & Deep AnalysisModels & Research
Key Points
- The paper introduces Dual Causal Inference (DCI) for Medical Visual Question Answering (MedVQA) to reduce reliance on superficial cross-modal correlations present in multimodal medical data.
- DCI unifies Backdoor Adjustment (BDA) for observable cross-modal biases with Instrumental Variable (IV) learning to address unobserved confounders within a single Structural Causal Model (SCM).
- It enforces mutual information constraints to ensure the learned IV is informative from fused multimodal representations while being minimally associated with unobserved confounders and target answers.
- Experiments on four benchmarks (SLAKE, SLAKE-CP, VQA-RAD, PathVQA) show consistent improvements over prior methods, with notable gains in out-of-distribution (OOD) generalization.
- Qualitative results suggest DCI improves interpretability and robustness by disentangling true causal effects from spurious visual-text shortcuts.
Related Articles

Big Tech firms are accelerating AI investments and integration, while regulators and companies focus on safety and responsible adoption.
Dev.to

Trajectory Forecasts in Unknown Environments Conditioned on Grid-Based Plans
Dev.to

Why use an AI gateway at all?
Dev.to

OpenAI Just Named It Workspace Agents. We Open-Sourced Our Lark Version Six Months Ago
Dev.to

GPT Image 2 Subject-Lock Editing: A Practical Guide to input_fidelity
Dev.to