Improving Pediatric Emergency Department Triage with Modality Dropout in Late Fusion Multimodal EHR Models

arXiv cs.LG / 4/14/2026

💬 OpinionSignals & Early TrendsIdeas & Deep AnalysisModels & Research

共有:

Key Points

The paper tackles “modality collapse” in multimodal triage models, where systems over-rely on structured tabular vitals and underuse clinical text, hurting generalizability for pediatric patients.
It proposes a late-fusion architecture combining XGBoost for tabular vitals and Bio_ClinicalBERT for unstructured clinical notes, then using a Logistic Regression meta-classifier to predict 5-level ESI acuity.
To address external validity, the model is trained only on adult encounters from MIMIC-IV and NHAMCS and tested in a pediatric cohort via zero-shot generalization.
Training uses symmetric modality dropout to force robustness against missing/overweighted modalities; a 30–40% dropout range produced the biggest gains in the pediatric evaluation.
The best reported pediatric improvement raises Quadratic Weighted Kappa to 0.351, and the authors argue modality dropout is an effective regularization strategy to reduce modality collapse in clinical AI.

Abstract

Emergency department triage relies heavily on both quantitative vital signs and qualitative clinical notes, yet multimodal machine learning models predicting triage acuity often suffer from modality collapse by over-relying on structured tabular data. This limitation severely hinders demographic generalizability, particularly for pediatric patients where developmental variations in vital signs make unstructured clinical narratives uniquely crucial. To address this gap, we propose a late-fusion multimodal architecture that processes tabular vitals via XGBoost and unstructured clinical text via Bio_ClinicalBERT, combined through a Logistic Regression meta-classifier to predict the 5-level Emergency Severity Index. To explicitly target the external validity problem, we train our model exclusively on adult encounters from the MIMIC-IV and NHAMCS datasets and evaluate its zero-shot generalization on a traditionally overlooked pediatric cohort. Furthermore, we employ symmetric modality dropout during training to prevent the ensemble from overfitting to adult-specific clinical correlations. Our results demonstrate that the multimodal framework significantly outperforms single-modality baselines. Most notably, applying a 30-40% symmetric modality dropout rate yielded steep performance improvements in the unseen pediatric cohort, elevating the Quadratic Weighted Kappa to 0.351. These findings highlight modality dropout as a critical regularization technique for mitigating modality collapse and enhancing cross-demographic generalization in clinical AI.

Black Hat Asia

AI Business

Big Tech firms are accelerating AI investments and integration, while regulators and companies focus on safety and responsible adoption.

Dev.to

Don't forget, there is more than forgetting: new metrics for Continual Learning

Dev.to

Microsoft MAI-Image-2-Efficient Review 2026: The AI Image Model Built for Production Scale

Dev.to

Bit of a strange question?

Reddit r/artificial

Improving Pediatric Emergency Department Triage with Modality Dropout in Late Fusion Multimodal EHR Models

Key Points

Abstract

Related Articles

Black Hat Asia

Big Tech firms are accelerating AI investments and integration, while regulators and companies focus on safety and responsible adoption.

Don't forget, there is more than forgetting: new metrics for Continual Learning

Microsoft MAI-Image-2-Efficient Review 2026: The AI Image Model Built for Production Scale

Bit of a strange question?

関連おすすめサービス

Notta搭載AI議事録イヤホン ZENCHORD1

AI搭載ボイスレコーダー Plaud

画像高画質化AIツール Aiarty Image Enhancer