Clinically Interpretable Sepsis Early Warning via LLM-Guided Simulation of Temporal Physiological Dynamics

arXiv cs.LG / 4/24/2026

📰 NewsSignals & Early TrendsIdeas & Deep AnalysisModels & Research

Key Points

  • The study tackles the difficulty of delivering timely, interpretable sepsis early warnings by explicitly modeling the temporal progression of physiological decline rather than relying solely on opaque predictions.
  • It proposes an LLM-guided temporal simulation framework that (1) extracts spatiotemporal relationships from multivariate vital signs, (2) injects clinical reasoning cues into an LLM via a “medical prompt as prefix,” and (3) uses agent-based post-processing to keep outputs within physiologically plausible limits.
  • The method simulates trajectories of key physiological indicators first, then classifies sepsis onset, aiming to produce more transparent, clinically aligned warning signals.
  • Experiments on MIMIC-IV and eICU show improved performance for 24-to-4-hour pre-onset prediction tasks, reaching AUCs of 0.861–0.903 and outperforming conventional deep learning and rule-based baselines.
  • The authors emphasize that the approach provides interpretable risk trends and trajectories that could support early intervention and more personalized decision-making in ICU settings.

Abstract

Timely and interpretable early warning of sepsis remains a major clinical challenge due to the complex temporal dynamics of physiological deterioration. Traditional data-driven models often provide accurate yet opaque predictions, limiting physicians' confidence and clinical applicability. To address this limitation, we propose a Large Language Model (LLM)-guided temporal simulation framework that explicitly models physiological trajectories prior to disease onset for clinically interpretable prediction. The framework consists of a spatiotemporal feature extraction module that captures dynamic dependencies among multivariate vital signs, a Medical Prompt-as-Prefix module that embeds clinical reasoning cues into LLMs, and an agent-based post-processing component that constrains predictions within physiologically plausible ranges. By first simulating the evolution of key physiological indicators and then classifying sepsis onset, our model offers transparent prediction mechanisms that align with clinical judgment. Evaluated on the MIMIC-IV and eICU databases, the proposed method achieves superior AUC scores (0.861-0.903) across 24-4-hour pre-onset prediction tasks, outperforming conventional deep learning and rule-based approaches. More importantly, it provides interpretable trajectories and risk trends that can assist clinicians in early intervention and personalized decision-making in intensive care environments.