A Multimodal and Explainable Machine Learning Approach to Diagnosing Multi-Class Ejection Fraction from Electrocardiograms

arXiv cs.LG / 4/30/2026

📰 NewsIdeas & Deep AnalysisModels & Research

Key Points

  • The study proposes a multimodal, explainable ML framework to classify left ventricular ejection fraction (LVEF) into four clinical strata using both engineered 12-lead ECG features and structured EHR variables.
  • It trains XGBoost models on a large retrospective dataset (36,784 ECG–echocardiogram pairs) and evaluates robustness with temporal generalization using a later-period cohort (19,966 ECGs).
  • The model provides interpretability by using SHAP attributions to identify the most influential ECG and EHR features driving predictions.
  • The multimodal approach achieves strong one-vs-rest AUROC scores across classes (0.95 severe, 0.92 moderate, 0.82 mild, 0.91 normal), outperforming ECG-only and EHR-only baselines and holding up under temporal validation.
  • The authors argue the method could enable practical ECG-based screening and triage to prioritize confirmatory imaging in primary care and resource-constrained settings.

Abstract

Left ventricular ejection fraction (LVEF) assessment depends on echocardiography, limiting access in primary care and resource-constrained settings. We developed a multimodal machine-learning framework that combines engineered 12-lead ECG timeseries features with structured EHR variables to classify LVEF into four clinically used strata: normal (>50%), mildly reduced (40-50%), moderately reduced (30-40%), and severely reduced (<30%). To support model explainability, we identified the most influential ECG and EHR features via SHAP attributions. Using retrospective data from Hartford HealthCare, we trained XGBoost models on 36,784 ECG-echocardiogram pairs from 30,952 outpatients and evaluated temporal generalizability on 19,966 ECGs from a subsequent period. The multimodal model achieved one-vs-rest AUROCs of 0.95 (severe), 0.92 (moderate), 0.82 (mild), and 0.91 (normal), outperforming ECG-only and EHR-only baselines, and maintained performance under temporal validation. This work supports ECG-based, multimodal LVEF stratification as a practical screening and triage aid to prioritize confirmatory imaging where resources are limited.