Causal-Transformer with Adaptive Mutation-Locking for Early Prediction of Acute Kidney Injury

arXiv cs.LG / 4/23/2026

📰 NewsIdeas & Deep AnalysisModels & Research

Key Points

  • The paper introduces CT-Former, a model designed for early prediction of Acute Kidney Injury (AKI) by combining continuous-time patient trajectory modeling with a Causal-Transformer architecture.
  • It addresses irregularly sampled clinical data without biased artificial imputation, using a continuous-time state evolution mechanism to reflect real patient timing.
  • To improve clinical trust and interpretability, the model replaces opaque hidden-state aggregation with a causal-attention module that produces a directed structural causal matrix for tracing when severe physiological shocks begin.
  • CT-Former is trained using a decoupled two-stage protocol that optimizes the causal-fusion component separately.
  • Experiments on the MIMIC-IV dataset (18,419 patients) show performance improvements over state-of-the-art baselines, indicating both higher accuracy and better interpretability for clinical decision-making.

Abstract

Accurate early prediction of Acute Kidney Injury (AKI) is critical for timely clinical intervention. However, existing deep learning models struggle with irregularly sampled data and suffer from the opaque "black-box" nature of sequential architectures, strictly limiting clinical trust. To address these challenges, we propose CT-Former, integrating continuous-time modeling with a Causal-Transformer. To handle data irregularity without biased artificial imputation, our framework utilizes a continuous-time state evolution mechanism to naturally track patient temporal trajectories. To resolve the black-box problem, our Causal-Attention module abandons uninterpretable hidden state aggregation. Instead, it generates a directed structural causal matrix to identify and trace the exact historical onset of severe physiological shocks. By establishing clear causal pathways between historical anomalies and current risk predictions, CT-Former provides native clinical interpretability. Training follows a decoupled two-stage protocol to optimize the causal-fusion process independently. Extensive experiments on the MIMIC-IV cohort (N=18,419) demonstrate that CT-Former significantly outperforms state-of-the-art baselines. The results confirm that our explicitly transparent architecture offers an accurate and trustworthy tool for clinical decision-making.