Can Large Language Models Reliably Extract Physiology Index Values from Coronary Angiography Reports?
arXiv cs.CL / 4/16/2026
💬 OpinionIdeas & Deep AnalysisModels & Research
Key Points
- The paper evaluates whether large language models can reliably extract physiology index values and their anatomical locations from unstructured coronary angiography (CAG) reports, focusing on Portuguese clinical text.
- Using a corpus of 1,342 reports, the study is presented as the first to address physiology-index extraction at this scale for CAG reports and as one of the few efforts targeting CAG/Portuguese clinical language.
- It compares local privacy-preserving general-purpose and medical LLMs under multiple prompting strategies (zero-shot, few-shot, and few-shot with implausible examples) and tests constrained generation plus a RegEx-based post-processing step.
- The authors propose a multi-stage evaluation framework that separately measures format validity, value detection, and value correctness while considering asymmetric clinical error costs.
- Results indicate that non-medical models can perform similarly, with the best overall performance reported for Llama under zero-shot prompting and GPT-OSS showing the highest robustness to prompt changes, while constrained generation and RegEx augmentation did not significantly improve most models’ outcomes.
Related Articles
"The AI Agent's Guide to Sustainable Income: From Zero to Profitability"
Dev.to
"The Hidden Economics of AI Agents: Survival Strategies in Competitive Markets"
Dev.to
Big Tech firms are accelerating AI investments and integration, while regulators and companies focus on safety and responsible adoption.
Dev.to
"The Hidden Costs of AI Agent Deployment: A CFO's Guide to True ROI in Enterpris
Dev.to
"The Real Cost of AI Compute: Why Token Efficiency Separates Viable Agents from
Dev.to