ReMedi: Reasoner for Medical Clinical Prediction

arXiv cs.CL / 5/5/2026

📰 NewsIdeas & Deep AnalysisModels & Research

Key Points

  • ReMedi is a new framework designed to improve prediction of future clinical outcomes from electronic health records (EHR), where patient data is complex and heterogeneous.
  • Rather than relying only on model interpretation or medical knowledge enhancement via distillation/RAG, ReMedi generates rationale–answer pairs using a challenging sample regeneration mechanism.
  • The approach uses ground-truth answers as hints during rationale generation, integrating outcome guidance into the preference data construction loop for subsequent fine-tuning and preference tuning.
  • Experiments across multiple EHR prediction tasks show substantial improvements, with up to a 19.9% gain in F1 score over state-of-the-art baselines.
  • Overall, the results suggest that explicitly guiding training with ground-truth outcome information can meaningfully boost real-world clinical prediction performance.

Abstract

Predicting future clinical outcomes from electronic health records (EHR) remains challenging due to the complexity and heterogeneity of patient data. LLMs have shown strong potential for such predictive tasks, yet existing approaches mainly focus on enhancing medical knowledge through distillation or RAG while relying on the model's internal ability to interpret contextual information. In this work, we present ReMedi (Reasoner for Medical Clinical Prediction), a framework for improving clinical outcome prediction from EHR. ReMedi generates rationale-answer pairs using a challenging sample regeneration mechanism for complex clinical questions, which leverages ground-truth answers as hints to enhance reasoning for further fine-tuning and preference tuning. ReMedi integrates ground-truth outcome guidance into the preference data construction loop, regenerating rationale-answer variants. By tuning on these rationale-answer pairs, the model improves its predictive performance. Experiments on multiple EHR prediction tasks demonstrate substantial gains of up to 19.9 percent over state-of-the-art baselines in terms of F1 score, underscoring ReMedi's effectiveness in real-world clinical prediction.