Restore, Assess, Repeat: A Unified Framework for Iterative Image Restoration

arXiv cs.CV / 3/30/2026

💬 OpinionSignals & Early TrendsIdeas & Deep AnalysisModels & Research

Key Points

  • The paper introduces RAR (Restore, Assess, and Repeat), a unified iterative framework that combines Image Quality Assessment (IQA) with Image Restoration (IR) to improve generalization across unknown and composite degradations.
  • RAR performs restoration entirely in the latent domain, jointly handling degradation identification, restoration, and quality verification within an end-to-end trainable model.
  • By tightly integrating IQA and IR, the approach reduces latency and information loss that can occur when IQA and restoration are kept as separate modules, especially during decoding steps.
  • Experiments report consistent improvements over prior methods for single, unknown, and composite degradation scenarios, claiming new state-of-the-art performance.

Abstract

Image restoration aims to recover high quality images from inputs degraded by various factors, such as adverse weather, blur, or low light. While recent studies have shown remarkable progress across individual or unified restoration tasks, they still suffer from limited generalization and inefficiency when handling unknown or composite degradations. To address these limitations, we propose RAR, a Restore, Assess and Repeat process, that integrates Image Quality Assessment (IQA) and Image Restoration (IR) into a unified framework to iteratively and efficiently achieve high quality image restoration. Specifically, we introduce a restoration process that operates entirely in the latent domain to jointly perform degradation identification, image restoration, and quality verification. The resulting model is fully trainable end to end and allows for an all-in-one assess and restore approach that dynamically adapts the restoration process. Also, the tight integration of IQA and IR into a unified model minimizes the latency and information loss that typically arises from keeping the two modules disjoint, (e.g. during image and/or text decoding). Extensive experiments show that our approach consistent improvements under single, unknown and composite degradations, thereby establishing a new state-of-the-art.