DINOLight: Robust Ambient Light Normalization with Self-supervised Visual Prior Integration

arXiv cs.CV / 3/16/2026

📰 NewsModels & Research

共有:

Key Points

DINOLight introduces a new ambient light normalization framework that uses DINOv2's self-supervised features as a visual prior for restoration.
It features an adaptive feature fusion module that combines DINOv2 multi-layer features using a point-wise softmax mask.
The fused features are integrated into the restoration network in both spatial and frequency domains via an auxiliary cross-attention mechanism.
Experiments on Ambient6K show state-of-the-art performance with competitive results on shadow-removal benchmarks, and code will be released upon acceptance.

Abstract

This paper presents a new ambient light normalization framework, DINOLight, that integrates the self-supervised model DINOv2's image understanding capability into the restoration process as a visual prior. Ambient light normalization aims to restore images degraded by non-uniform shadows and lighting caused by multiple light sources and complex scene geometries. We observe that DINOv2 can reliably extract both semantic and geometric information from a degraded image. Based on this observation, we develop a novel framework to utilize DINOv2 features for lighting normalization. First, we propose an adaptive feature fusion module that combines features from different DINOv2 layers using a point-wise softmax mask. Next, the fused features are integrated into our proposed restoration network in both spatial and frequency domains through an auxiliary cross-attention mechanism. Experiments show that DINOLight achieves superior performance on the Ambient6K dataset, and that DINOv2 features are effective for enhancing ambient light normalization. We also apply our method to shadow-removal benchmark datasets, achieving competitive results compared to methods that use mask priors. Codes will be released upon acceptance.

PearlOS. We gave swarm intelligence a local desktop environment and code control to self-evolve. Has been pretty incredible to see so far. Open source and free if you want your own.

Reddit r/LocalLLaMA

QwenDean-4B | fine-tuned SLM for UIGen; our first attempt, looking for feedback!

Reddit r/LocalLLaMA

acestep.cpp: portable C++17 implementation of ACE-Step 1.5 music generation using GGML. Runs on CPU, CUDA, ROCm, Metal, Vulkan

Reddit r/LocalLLaMA

Introducing SPEED-Bench: A Unified and Diverse Benchmark for Speculative Decoding

Hugging Face Blog

Newest GPU server in the lab! 72gb ampere vram!

Reddit r/LocalLLaMA

DINOLight: Robust Ambient Light Normalization with Self-supervised Visual Prior Integration

Key Points

Abstract

Related Articles

PearlOS. We gave swarm intelligence a local desktop environment and code control to self-evolve. Has been pretty incredible to see so far. Open source and free if you want your own.

QwenDean-4B | fine-tuned SLM for UIGen; our first attempt, looking for feedback!

acestep.cpp: portable C++17 implementation of ACE-Step 1.5 music generation using GGML. Runs on CPU, CUDA, ROCm, Metal, Vulkan

Introducing SPEED-Bench: A Unified and Diverse Benchmark for Speculative Decoding

Newest GPU server in the lab! 72gb ampere vram!

関連おすすめサービス

Notta搭載AI議事録イヤホン ZENCHORD1

AI搭載ボイスレコーダー Plaud

画像高画質化AIツール Aiarty Image Enhancer

Key Points

Abstract

Related Articles

PearlOS. We gave swarm intelligence a local desktop environment and code control to self-evolve. Has been pretty incredible to see so far. Open source and free if you want your own.

QwenDean-4B | fine-tuned SLM for UIGen; our first attempt, looking for feedback!

acestep.cpp: portable C++17 implementation of ACE-Step 1.5 music generation using GGML. Runs on CPU, CUDA, ROCm, Metal, Vulkan

**Introducing SPEED-Bench: A Unified and Diverse Benchmark for Speculative Decoding**

Newest GPU server in the lab! 72gb ampere vram!

関連おすすめサービス

Notta搭載AI議事録イヤホン ZENCHORD1

AI搭載ボイスレコーダー Plaud

画像高画質化AIツール Aiarty Image Enhancer

Introducing SPEED-Bench: A Unified and Diverse Benchmark for Speculative Decoding