Resource-Conscious Modeling for Next- Day Discharge Prediction Using Clinical Notes

arXiv cs.AI / 4/7/2026

📰 NewsSignals & Early TrendsIdeas & Deep AnalysisModels & Research

Key Points

  • The study assesses resource-conscious modeling approaches to predict next-day discharge in elective spine surgery using postoperative clinical notes.
  • It benchmarks 13 models, finding that a non-generative pipeline (TF-IDF with LGBM) achieves the best overall performance, including an AUC-ROC of 0.80 and F1-score of 0.47 for the discharge class.
  • Compact LLMs fine-tuned with LoRA (e.g., DistilGPT-2 and Bio_ClinicalBERT) show limited effectiveness overall, with LoRA helping recall for DistilGPT2 but not surpassing the leading text-based baseline.
  • The authors conclude that interpretable, lightweight models can outperform transformer-based/generative approaches in imbalanced, real-world clinical prediction settings, supporting practical deployment under resource constraints.

Abstract

Timely discharge prediction is essential for optimizing bed turnover and resource allocation in elective spine surgery units. This study evaluates the feasibility of lightweight, fine-tuned large language models (LLMs) and traditional text-based models for predicting next-day discharge using postoperative clinical notes. We compared 13 models, including TF-IDF with XGBoost and LGBM, and compact LLMs (DistilGPT-2, Bio_ClinicalBERT) fine-tuned via LoRA. TF-IDF with LGBM achieved the best balance, with an F1-score of 0.47 for the discharge class, a recall of 0.51, and the highest AUC-ROC (0.80). While LoRA improved recall in DistilGPT2, overall transformer-based and generative models underperformed. These findings suggest interpretable, resource-efficient models may outperform compact LLMs in real-world, imbalanced clinical prediction tasks.