Improving Pediatric Emergency Department Triage with Modality Dropout in Late Fusion Multimodal EHR Models

arXiv cs.LG / 4/14/2026

💬 OpinionSignals & Early TrendsIdeas & Deep AnalysisModels & Research

Key Points

  • The paper tackles “modality collapse” in multimodal triage models, where systems over-rely on structured tabular vitals and underuse clinical text, hurting generalizability for pediatric patients.
  • It proposes a late-fusion architecture combining XGBoost for tabular vitals and Bio_ClinicalBERT for unstructured clinical notes, then using a Logistic Regression meta-classifier to predict 5-level ESI acuity.
  • To address external validity, the model is trained only on adult encounters from MIMIC-IV and NHAMCS and tested in a pediatric cohort via zero-shot generalization.
  • Training uses symmetric modality dropout to force robustness against missing/overweighted modalities; a 30–40% dropout range produced the biggest gains in the pediatric evaluation.
  • The best reported pediatric improvement raises Quadratic Weighted Kappa to 0.351, and the authors argue modality dropout is an effective regularization strategy to reduce modality collapse in clinical AI.

Abstract

Emergency department triage relies heavily on both quantitative vital signs and qualitative clinical notes, yet multimodal machine learning models predicting triage acuity often suffer from modality collapse by over-relying on structured tabular data. This limitation severely hinders demographic generalizability, particularly for pediatric patients where developmental variations in vital signs make unstructured clinical narratives uniquely crucial. To address this gap, we propose a late-fusion multimodal architecture that processes tabular vitals via XGBoost and unstructured clinical text via Bio_ClinicalBERT, combined through a Logistic Regression meta-classifier to predict the 5-level Emergency Severity Index. To explicitly target the external validity problem, we train our model exclusively on adult encounters from the MIMIC-IV and NHAMCS datasets and evaluate its zero-shot generalization on a traditionally overlooked pediatric cohort. Furthermore, we employ symmetric modality dropout during training to prevent the ensemble from overfitting to adult-specific clinical correlations. Our results demonstrate that the multimodal framework significantly outperforms single-modality baselines. Most notably, applying a 30-40% symmetric modality dropout rate yielded steep performance improvements in the unseen pediatric cohort, elevating the Quadratic Weighted Kappa to 0.351. These findings highlight modality dropout as a critical regularization technique for mitigating modality collapse and enhancing cross-demographic generalization in clinical AI.