MedGemma 1.5 Technical Report

arXiv cs.AI / 4/8/2026

📰 NewsSignals & Early TrendsModels & Research

Key Points

  • The paper introduces MedGemma 1.5 4B, a new release in the MedGemma model family, expanding beyond MedGemma 1 with added multimodal clinical capabilities.
  • MedGemma 1.5 supports high-dimensional medical imaging (3D CT/MRI volumes and whole-slide histopathology), anatomical localization via bounding boxes, and multi-timepoint chest X-ray analysis.
  • It also improves medical document understanding, including lab reports and electronic health records, with new training data and modality-specific techniques such as long-context 3D volume slicing and whole-slide pathology sampling.
  • Reported performance gains versus MedGemma 1 include +11% absolute for 3D MRI condition classification, +3% for 3D CT condition classification, +47% macro F1 for whole-slide pathology, and +35% IoU improvement for chest X-ray localization.
  • The model further improves clinical text-based benchmarks, including +5% MedQA accuracy, +22% EHRQA accuracy, and ~18% macro F1 across four lab-report extraction datasets, and is released as an open resource with tutorials.

Abstract

We introduce MedGemma 1.5 4B, the latest model in the MedGemma collection. MedGemma 1.5 expands on MedGemma 1 by integrating additional capabilities: high-dimensional medical imaging (CT/MRI volumes and histopathology whole slide images), anatomical localization via bounding boxes, multi-timepoint chest X-ray analysis, and improved medical document understanding (lab reports, electronic health records). We detail the innovations required to enable these modalities within a single architecture, including new training data, long-context 3D volume slicing, and whole-slide pathology sampling. Compared to MedGemma 1 4B, MedGemma 1.5 4B demonstrates significant gains in these new areas, improving 3D MRI condition classification accuracy by 11% and 3D CT condition classification by 3% (absolute improvements). In whole slide pathology imaging, MedGemma 1.5 4B achieves a 47% macro F1 gain. Additionally, it improves anatomical localization with a 35% increase in Intersection over Union on chest X-rays and achieves a 4% macro accuracy for longitudinal (multi-timepoint) chest x-ray analysis. Beyond its improved multimodal performance over MedGemma 1, MedGemma 1.5 improves on text-based clinical knowledge and reasoning, improving by 5% on MedQA accuracy and 22% on EHRQA accuracy. It also achieves an average of 18% macro F1 on 4 different lab report information extraction datasets (EHR Datasets 2, 3, 4, and Mendeley Clinical Laboratory Test Reports). Taken together, MedGemma 1.5 serves as a robust, open resource for the community, designed as an improved foundation on which developers can create the next generation of medical AI systems. Resources and tutorials for building upon MedGemma 1.5 can be found at https://goo.gle/MedGemma.