AI Navigate

DINOLight: Robust Ambient Light Normalization with Self-supervised Visual Prior Integration

arXiv cs.CV / 3/16/2026

📰 NewsModels & Research

Key Points

  • DINOLight introduces a new ambient light normalization framework that uses DINOv2's self-supervised features as a visual prior for restoration.
  • It features an adaptive feature fusion module that combines DINOv2 multi-layer features using a point-wise softmax mask.
  • The fused features are integrated into the restoration network in both spatial and frequency domains via an auxiliary cross-attention mechanism.
  • Experiments on Ambient6K show state-of-the-art performance with competitive results on shadow-removal benchmarks, and code will be released upon acceptance.

Abstract

This paper presents a new ambient light normalization framework, DINOLight, that integrates the self-supervised model DINOv2's image understanding capability into the restoration process as a visual prior. Ambient light normalization aims to restore images degraded by non-uniform shadows and lighting caused by multiple light sources and complex scene geometries. We observe that DINOv2 can reliably extract both semantic and geometric information from a degraded image. Based on this observation, we develop a novel framework to utilize DINOv2 features for lighting normalization. First, we propose an adaptive feature fusion module that combines features from different DINOv2 layers using a point-wise softmax mask. Next, the fused features are integrated into our proposed restoration network in both spatial and frequency domains through an auxiliary cross-attention mechanism. Experiments show that DINOLight achieves superior performance on the Ambient6K dataset, and that DINOv2 features are effective for enhancing ambient light normalization. We also apply our method to shadow-removal benchmark datasets, achieving competitive results compared to methods that use mask priors. Codes will be released upon acceptance.