Predicting Post-Traumatic Epilepsy from Clinical Records using Large Language Model Embeddings

arXiv cs.LG / 4/17/2026

📰 NewsDeveloper Stack & InfrastructureModels & Research

Key Points

  • The study explores whether routine acute clinical records can predict post-traumatic epilepsy (PTE) early without relying on resource-intensive neuroimaging.
  • It uses pretrained large language model (LLM) embeddings as fixed feature extractors to encode clinical records from a curated TRACK-TBI cohort, comparing tabular features, LLM embeddings, and hybrid representations.
  • A modality-aware fusion of tabular data and LLM embeddings outperformed single-source features, reaching an AUC-ROC of 0.892 and AUPRC of 0.798.
  • Predictive signal was strongly associated with factors such as acute post-traumatic seizures, injury severity, neurosurgical intervention, and ICU stay.
  • Overall, the work suggests LLM-embedding-based pipelines can complement imaging-based approaches for early PTE risk stratification.

Abstract

Objective: Post-traumatic epilepsy (PTE) is a debilitating neurological disorder that develops after traumatic brain injury (TBI). Early prediction of PTE remains challenging due to heterogeneous clinical data, limited positive cases, and reliance on resource-intensive neuroimaging data. We investigate whether routinely collected acute clinical records alone can support early PTE prediction using language model-based approaches. Methods: Using a curated subset of the TRACK-TBI cohort, we developed an automated PTE prediction framework that implements pretrained large language models (LLMs) as fixed feature extractors to encode clinical records. Tabular features, LLM-generated embeddings, and hybrid feature representations were evaluated using gradient-boosted tree classifiers under stratified cross-validation. Results: LLM embeddings achieved performance improvements by capturing contextual clinical information compared to using tabular features alone. The best performance was achieved by a modality-aware feature fusion strategy combining tabular features and LLM embeddings, achieving an AUC-ROC of 0.892 and AUPRC of 0.798. Acute post-traumatic seizures, injury severity, neurosurgical intervention, and ICU stay are key contributors to the predictive performance. Significance: These findings demonstrate that routine acute clinical records contain information suitable for early PTE risk prediction using LLM embeddings in conjunction with gradient-boosted tree classifiers. This approach represents a promising complement to imaging-based prediction.