Diffusion Autoencoder for Unsupervised Artifact Restoration in Handheld Fundus Images

arXiv cs.CV / 4/20/2026

📰 NewsIdeas & Deep AnalysisModels & Research

Key Points

  • The paper introduces an unsupervised diffusion autoencoder designed to restore artifacts in handheld fundus images, where issues like flash reflections, exposure variation, and motion blur commonly degrade quality.
  • Unlike many generative restoration methods that require paired supervision or fixed artifact assumptions, the proposed approach learns from only high-quality tabletop fundus images and adapts to unstructured degradations during inference.
  • The method combines a context encoder with a denoising diffusion process to produce semantically meaningful representations for artifact restoration.
  • The authors report improved diagnostic performance on an unseen dataset, achieving 81.17% accuracy across multiple artifact conditions, supported by quantitative and qualitative evaluation.

Abstract

The advent of handheld fundus imaging devices has made ophthalmologic diagnosis and disease screening more accessible, efficient, and cost-effective. However, images captured from these setups often suffer from artifacts such as flash reflections, exposure variations, and motion-induced blur, which degrade image quality and hinder downstream analysis. While generative models have been effective in image restoration, most depend on paired supervision or predefined artifact structures, making them less adaptable to unstructured degradations commonly observed in handheld fundus images. To address this, we propose an unsupervised diffusion autoencoder that integrates a context encoder with the denoising process to learn semantically meaningful representations for artifact restoration. The model is trained only on high-quality table-top fundus images and infers to restore artifact-affected handheld acquisitions. We validate the restorations through quantitative and qualitative evaluations, and have shown that diagnostic accuracy increases to 81.17% on an unseen dataset and multiple artifact conditions

Diffusion Autoencoder for Unsupervised Artifact Restoration in Handheld Fundus Images | AI Navigate