AI Navigate

Bilevel Layer-Positioning LoRA for Real Image Dehazing

arXiv cs.CV / 3/12/2026

📰 NewsIdeas & Deep AnalysisModels & Research

Key Points

  • The paper proposes a haze-to-clear text-directed loss that leverages CLIP's cross-modal capabilities to formulate real image dehazing as a semantic alignment problem in latent space, providing unsupervised guidance without reference images.
  • It introduces BiLaLoRA (Bilevel Layer-positioning LoRA), which jointly learns LoRA parameters and automatically searches injection layers for targeted adaptation of critical network layers.
  • The method reports superior performance on multiple real-world dehazing benchmarks, demonstrating effective adaptation without full fine-tuning.
  • The authors release code on GitHub for reproducibility and practical adoption.

Abstract

Learning-based real image dehazing methods have achieved notable progress, yet they still face adaptation challenges in diverse real haze scenes. These challenges mainly stem from the lack of effective unsupervised mechanisms for unlabeled data and the heavy cost of full model fine-tuning. To address these challenges, we propose the haze-to-clear text-directed loss that leverages CLIP's cross-modal capabilities to reformulate real image dehazing as a semantic alignment problem in latent space, thereby providing explicit unsupervised cross-modal guidance in the absence of reference images. Furthermore, we introduce the Bilevel Layer-positioning LoRA (BiLaLoRA) strategy, which learns both the LoRA parameters and automatically search the injection layers, enabling targeted adaptation of critical network layers. Extensive experiments demonstrate our superiority against state-of-the-art methods on multiple real-world dehazing benchmarks. The code is publicly available at https://github.com/YanZhang-zy/BiLaLoRA.