Winner of CVPR2026 NTIRE Challenge on Image Shadow Removal: Semantic and Geometric Guidance for Shadow Removal via Cascaded Refinement

arXiv cs.CV / 4/20/2026

📰 NewsIndustry & Market MovesModels & Research

Key Points

  • The paper describes a three-stage, progressive image shadow-removal pipeline for the CVPR 2026 NTIRE WSRD+ challenge, built on OmniSR and formulated as iterative direct refinement.
  • It jointly leverages RGB appearance, frozen DINOv2 semantic guidance, and geometric cues derived from monocular depth and surface normals, with the same guidance reused across all stages.
  • To make the multi-stage cascade stable, the authors propose a contraction-constrained objective that promotes non-increasing reconstruction error through the refinement stages.
  • Training uses a staged transfer strategy from earlier WSRD pretraining to WSRD+ supervision, followed by a final WSRD+ 2026 adaptation and cosine-annealed checkpoint ensembling.
  • On the official hidden WSRD+ 2026 test set, the final ensemble achieved the best overall results, winning the NTIRE 2026 Image Shadow Removal Challenge and performing strongly on ISTD+ and UAV-SC+ as well.

Abstract

We present a three-stage progressive shadow-removal pipeline for the CVPR2026 NTIRE WSRD+ challenge. Built on OmniSR, our method treats deshadowing as iterative direct refinement, where later stages correct residual artefacts left by earlier predictions. The model combines RGB appearance with frozen DINOv2 semantic guidance and geometric cues from monocular depth and surface normals, reused across all stages. To stabilise multi-stage optimisation, we introduce a contraction-constrained objective that encourages non-increasing reconstruction error across the cascade. A staged training pipeline transfers from earlier WSRD pretraining to WSRD+ supervision and final WSRD+ 2026 adaptation with cosine-annealed checkpoint ensembling. On the official WSRD+ 2026 hidden test set, our final ensemble achieved 26.680 PSNR, 0.8740 SSIM, 0.0578 LPIPS, and 26.135 FID, ranked first overall, and won the NTIRE 2026 Image Shadow Removal Challenge. The strong performance of the proposed model is further validated on the ISTD+ and UAV-SC+ datasets.