CLIP-Guided Data Augmentation for Night-Time Image Dehazing

arXiv cs.CV / 4/8/2026

📰 NewsSignals & Early TrendsModels & Research

Key Points

  • The paper addresses nighttime image dehazing as a harder problem than daytime due to haze scattering interacting with low light, non-uniform illumination, and strong light interference.
  • It proposes a unified NTIRE 2026 challenge framework that uses CLIP-based similarity screening to construct external training data that is closer to the target domain under limited supervision.
  • The method trains NAFNet in two stages—first adapting to the target domain and then expanding to broader degradation patterns to reduce domain drift and training instability.
  • For inference, it combines TLC, x8 self-ensemble, and weighted snapshot fusion to improve stability and output quality without requiring major network redesign.

Abstract

Nighttime image dehazing faces a more complex degradation pattern than its daytime counterpart, as haze scattering couples with low illumination, non-uniform lighting, and strong light interference. Under limited supervision, this complexity aggravates domain drift and training instability, since target-domain samples are scarce while naively introducing external data may weaken adaptation due to distribution mismatch. This paper presents our solution to the NTIRE 2026 Night Time Image Dehazing Challenge, built as a unified framework that integrates domain-aligned data construction, stage-wise training, and inference-time enhancement. Specifically, a pre-trained CLIP visual encoder screens candidate external samples by similarity to construct training data closer to the target domain. NAFNet is then trained in two stages, first adapting to the target domain and then expanding to broader degradation patterns. At inference time, TLC, x8 self-ensemble, and weighted snapshot fusion are combined to improve output stability. Rather than relying on complex network redesign, the proposed framework offers a practical and effective pipeline for nighttime image dehazing.