deSEO: Physics-Aware Dataset Creation for High-Resolution Satellite Image Shadow Removal

arXiv cs.CV / 5/6/2026

📰 NewsModels & Research

Key Points

  • The paper introduces deSEO, a physics-aware, geometry-consistent approach to create paired (shadow / shadow-free) supervision for high-resolution satellite image shadow removal.
  • It fills a gap in existing Earth-observation datasets, which are often designed for shadow detection or 3D modeling and typically lack paired, geometry-consistent shadow-free targets.
  • deSEO builds pairs from the S-EO shadow detection dataset by selecting weak-reference tiles and using temporal/geometric filtering plus Jacobian-based orientation normalization and LoFTR-RANSAC registration.
  • The method uses a per-pixel validity mask to limit learning to reliably aligned regions, and it trains a DSM-aware deshadowing model that combines residual translation, perceptual losses, and mask-constrained adversarial learning.
  • Experiments show consistent reductions in shadow visual impact across varying illumination and viewing conditions, while a direct adaptation of UAV-based SRNet/pix2pix architectures fails to converge for satellite viewpoint variability.

Abstract

Shadows cast by terrain and tall structures remain a major obstacle for high-resolution satellite image analysis, degrading classification, detection, and 3D reconstruction performance. Public resources offering geometry-consistent paired shadow/shadow-free satellite imagery are essentially missing, and most Earth-observation datasets are designed for shadow detection or 3D modelling rather than removal. Existing deep shadow-removal datasets either target ground-level or aerial scenes or rely on unpaired and weakly supervised formulations rather than explicit satellite pairs. We address this gap with deSEO, a geometry-aware and physics-informed methodology that, to the best of our knowledge, is the first to derive paired supervision for satellite shadow removal from the S-EO shadow detection dataset through a fully replicable pipeline. For each tile, deSEO selects a minimally shadowed acquisition as a weak reference and pairs it with shadowed counterparts using temporal and geometric filtering, Jacobian-based orientation normalisation, and LoFTR-RANSAC registration. A per-pixel validity mask restricts learning to reliably aligned regions, enabling supervision despite residual off-nadir parallax. In addition to this paired dataset, we develop a DSM-aware deshadowing model that combines residual translation, perceptual objectives, and mask-constrained adversarial learning. In contrast, a direct adaptation of a UAV-based SRNet/pix2pix architecture fails to converge under satellite viewpoint variability. Our model consistently reduces the visual impact of cast shadows across diverse illumination and viewing conditions, achieving improved structural and perceptual fidelity on held-out scenes. deSEO therefore provides the first reproducible, geometry-aware paired dataset and baseline for shadow removal in satellite Earth observation.