FeatEHR-LLM: Leveraging Large Language Models for Feature Engineering in Electronic Health Records
arXiv cs.AI / 4/27/2026
💬 OpinionTools & Practical UsageModels & Research
Key Points
- The FeatEHR-LLM framework uses large language models (LLMs) to automatically generate clinically meaningful tabular features from irregularly sampled electronic health record (EHR) time series.
- It addresses EHR-specific challenges such as irregular observation intervals, varying measurement frequencies, and structural sparsity by using tool-augmented mechanisms that query temporal data and produce feature-extraction code that handles uneven patterns.
- To protect patient privacy, the LLM generates features using only dataset schemas and task descriptions rather than accessing raw patient records.
- The system supports both univariate and multivariate feature generation via an iterative, validation-in-the-loop pipeline.
- Across eight clinical prediction tasks on four ICU datasets, FeatEHR-LLM achieved the best mean AUROC on 7 of 8 tasks, improving results by up to 6 percentage points over strong baselines.
💡 Insights using this article
This article is featured in our daily AI news digest — key takeaways and action items at a glance.




