Automating Clinical Information Retrieval from Finnish Electronic Health Records Using Large Language Models
arXiv cs.CL / 3/30/2026
💬 OpinionIdeas & Deep AnalysisTools & Practical UsageModels & Research
Key Points
- The paper proposes a locally deployable Clinical Contextual Question Answering (CCQA) framework that answers clinician questions directly from Finnish EHR text without transferring data externally.
- It benchmarks multiple open-source LLMs (4B–70B parameters) using an offline dataset of 1,664 expert-annotated question–answer pairs from 183 patients, with most text in Finnish.
- Llama-3.1-70B achieved high free-text performance (95.3% accuracy and 97.3% consistency across semantically equivalent question variants), while Qwen3-30B-A3B-2507 showed comparable results.
- Quantization to 4-bit and 8-bit helped reduce GPU memory needs while largely preserving predictive performance, improving deployment feasibility in offline settings.
- Clinical evaluation found clinically significant errors in 2.9% of outputs and showed that semantically equivalent questions can still produce discordant answers, underscoring the need for validation and human oversight.




