Serialisation Strategy Matters: How FHIR Data Format Affects LLM Medication Reconciliation
arXiv cs.CL / 4/24/2026
📰 NewsDeveloper Stack & InfrastructureIdeas & Deep AnalysisModels & Research
Key Points
- The paper argues that how FHIR data is serialized before being fed to an LLM is a largely understudied but fundamental variable for medication reconciliation performance.
- It presents the first systematic comparison of four FHIR serialization strategies (Raw JSON, Markdown Table, Clinical Narrative, and Chronological Timeline) across five open-weight LLMs using a controlled benchmark of 200 synthetic patients and 4,000 inference runs.
- For models up to 8B parameters, “Clinical Narrative” significantly outperforms “Raw JSON,” improving F1 by up to 19 points for Mistral-7B, while the advantage flips at the 70B scale where “Raw JSON” yields the best mean F1.
- The study finds omission is the dominant error mode (missing an active medication more often than hallucinating one), implying that clinical safety auditing should prioritize coverage/omissions over fabrication.
- Smaller models also plateau around 7–10 concurrent active medications, systematically under-serving polypharmacy patients, and BioMistral-7B outputs were unusable in all conditions, indicating that domain pretraining alone is insufficient without instruction tuning.
Related Articles

Your MCP server probably has too many tools
Dev.to

MCP Auth That Actually Works: OAuth for Remote Servers
Dev.to

GoDavaii's Day 5: When 22 Indian Languages Redefine 'Hard' in Health AI
Dev.to

Gemma 4 and Qwen 3.6 with q8_0 and q4_0 KV cache: KL divergence results
Reddit r/LocalLLaMA
Corea arresta a hombre por imagen IA falsa del lobo Neukgu: hasta 5 años
Dev.to