Validating a Deep Learning Algorithm to Identify Patients with Glaucoma using Systemic Electronic Health Records

arXiv cs.LG / 4/24/2026

📰 NewsDeveloper Stack & InfrastructureIdeas & Deep AnalysisModels & Research

Key Points

  • The study validated a pretrained glaucoma risk assessment (GRA) deep learning model on an independent institution using systemic EHR data only.
  • It analyzed 20,636 Stanford patients (15% with glaucoma) and fine-tuned the model on the Stanford cohort before testing on a held-out set.
  • The best-performing model reached AUROC 0.883 and PPV 0.657, indicating strong discrimination for glaucoma risk.
  • Model calibration matched clinical risk patterns, with the highest prediction decile showing the highest diagnosis (65.7%) and treatment (57.0%) rates.
  • Performance improved with more trainable layers (up to 15) and additional training data, supporting scalable pre-screening without specialized imaging.

Abstract

We evaluated whether a glaucoma risk assessment (GRA) model trained on All of Us national data can identify patients at high probability of glaucoma using only systemic electronic health records (EHR) at an independent institution. In this cross-sectional study, 20,636 Stanford patients seen from November 2013 to January 2024 were included (15% with glaucoma). A pretrained GRA model was fine-tuned on the Stanford cohort and tested on a held-out set using demographics, systemic diagnoses, medications, laboratory results, and physical examination measurements as inputs. The best model achieved AUROC 0.883 and PPV 0.657. Calibration was consistent with clinical risk: the highest prediction decile showed the greatest glaucoma diagnosis rate (65.7%) and treatment rate (57.0%). Performance improved with more trainable layers up to 15 and with additional data. An EHR-only GRA model may enable scalable and accessible pre-screening without specialized imaging.