A Multidisciplinary AI Board for Multimodal Dementia Characterization and Risk Assessment

arXiv cs.AI / 2026/3/24

💬 オピニオンSignals & Early TrendsIdeas & Deep AnalysisModels & Research

要点

  • The paper introduces “Cerebra,” an interactive multi-agent AI system that coordinates specialized agents to analyze EHR data, clinical notes, and medical imaging for dementia characterization and risk assessment.
  • It emphasizes clinician-facing decision support by combining visual analytics with a conversational interface, allowing clinicians to interrogate predictions and contextualize risk at the point of care.
  • Cerebra is designed to be robust to incomplete modalities and supports privacy-preserving deployment by working on structured representations rather than raw data streams.
  • Evaluated on a large multi-institution dataset covering 3 million patients across four healthcare systems, Cerebra outperforms both state-of-the-art single-modality models and multimodal LLM baselines on multiple metrics (e.g., AUROC up to 0.80 for risk, 0.86 for diagnosis, C-index 0.81 for survival).
  • A reader study with experienced physicians found improved performance, with prospective dementia risk estimation accuracy increasing by 17.5 percentage points versus experts without the system.

Abstract

Modern clinical practice increasingly depends on reasoning over heterogeneous, evolving, and incomplete patient data. Although recent advances in multimodal foundation models have improved performance on various clinical tasks, most existing models remain static, opaque, and poorly aligned with real-world clinical workflows. We present Cerebra, an interactive multi-agent AI team that coordinates specialized agents for EHR, clinical notes, and medical imaging analysis. These outputs are synthesized into a clinician-facing dashboard that combines visual analytics with a conversational interface, enabling clinicians to interrogate predictions and contextualize risk at the point of care. Cerebra supports privacy-preserving deployment by operating on structured representations and remains robust when modalities are incomplete. We evaluated Cerebra using a massive multi-institutional dataset spanning 3 million patients from four independent healthcare systems. Cerebra consistently outperformed both state-of-the-art single-modality models and large multimodal language model baselines. In dementia risk prediction, it achieved AUROCs up to 0.80, compared with 0.74 for the strongest single-modality model and 0.68 for language model baselines. For dementia diagnosis, it achieved an AUROC of 0.86, and for survival prediction, a C-index of 0.81. In a reader study with experienced physicians, Cerebra significantly improved expert performance, increasing accuracy by 17.5 percentage points in prospective dementia risk estimation. These results demonstrate Cerebra's potential for interpretable, robust decision support in clinical care.