Beyond Patient Invariance: Learning Cardiac Dynamics via Action-Conditioned JEPAs

arXiv cs.LG / 4/27/2026

📰 NewsIdeas & Deep AnalysisModels & Research

Key Points

  • The paper argues that invariance-based self-supervised learning in healthcare can be mathematically misaligned with diagnosis because it suppresses transient pathological changes it should detect.
  • It proposes Action-Conditioned (event-conditioned) world models that simulate disease progression by treating pathology as a transition vector applied to a patient’s latent state rather than a static label.
  • Using an adaptation of the LeJEPA framework for physiological time-series, the model predicts future cardiac electrophysiological states conditioned on disease onset to disentangle stable anatomy from dynamic pathological forces.
  • On the MIMIC-IV-ECG dataset, the method outperforms fully supervised baselines on an ECG triage task and shows improved sample efficiency in low-resource settings, gaining over 0.05 AUROC.
  • The authors state the approach provides a denser and more robust supervision signal than static classification, and they provide source code via the linked GitHub repository.

Abstract

Self-supervised learning in healthcare has largely relied on invariance-based objectives, which maximize similarity between different views of the same patient. While effective for static anatomy, this paradigm is fundamentally misaligned with clinical diagnosis, as it mathematically compels the model to suppress the transient pathological changes it is intended to detect. We propose a shift towards Action-Conditioned World Models that learn to simulate the dynamics of disease progression, or Event-Conditioned. Adapting the LeJEPA framework to physiological time-series, we define pathology not as a static label, but as a transition vector acting on a patient's latent state. By predicting the future electrophysiological state of the heart given a disease onset, our model explicitly disentangles stable anatomical features from dynamic pathological forces. Evaluated on the MIMIC-IV-ECG dataset, our approach outperforms fully supervised baselines on the critical triage task. Crucially, we demonstrate superior sample efficiency: in low-resource regimes, our world model outperforms supervised learning by over 0.05 AUROC. These results suggest that modeling biological dynamics provides a dense supervision signal that is far more robust than static classification. Source code is available at https://github.com/cljosegfer/lesaude-dynamics