Managing Diabetic Retinopathy with Deep Learning: A Data Centric Overview

arXiv cs.AI / 4/6/2026

💬 OpinionIdeas & Deep AnalysisModels & Research

Key Points

  • The paper reviews and compares fundus image datasets for diabetic retinopathy (DR), focusing on how dataset limitations affect deep learning performance and clinical reliability.
  • It evaluates dataset usability across key DR tasks such as binary classification, severity grading, lesion localization, and multi-disease screening, and organizes datasets by size, accessibility, and annotation type.
  • The analysis highlights persistent gaps including inconsistent image quality, geographically narrow coverage, lack of standardized lesion-level annotations, and limited longitudinal data.
  • As a case study, it presents a recently published dataset to illustrate challenges in dataset curation, usage, and annotation practices.
  • The paper concludes with recommendations for future dataset development aimed at enabling clinically reliable and explainable DR screening systems.

Abstract

Diabetic Retinopathy (DR) is a serious microvascular complication of diabetes, and one of the leading causes of vision loss worldwide. Although automated detection and grading, with Deep Learning (DL), can reduce the burden on ophthalmologists, it is constrained by the limited availability of high-quality datasets. Existing repositories often remain geographically narrow, contain limited samples, and exhibit inconsistent annotations or variable image quality; thereby, restricting their clinical reliability. This paper presents a comprehensive review and comparative analysis of fundus image datasets used in the management of DR. The study evaluates their usability across key tasks, including binary classification, severity grading, lesion localization, and multi-disease screening. It also categorizes the datasets by size, accessibility, and annotation type (such as image-level, lesion-level, and multi-disease). Finally, a recently published dataset is presented as a case study to illustrate broader challenges in dataset curation and usage. The review consolidates current knowledge while highlighting persistent gaps such as the lack of standardized lesion-level annotations and longitudinal data. It also outlines recommendations for future dataset development to support clinically reliable and explainable solutions in DR screening.