AI Navigate

CrossEarth-SAR: A SAR-Centric and Billion-Scale Geospatial Foundation Model for Domain Generalizable Semantic Segmentation

arXiv cs.CV / 3/13/2026

📰 NewsDeveloper Stack & InfrastructureModels & Research

Key Points

  • CrossEarth-SAR introduces a billion-scale SAR vision foundation model built on a physics-guided sparse mixture-of-experts (MoE) architecture designed for cross-domain semantic segmentation across SAR sensors and regions.
  • The work provides CrossEarth-SAR-200K, a large dataset combining public and private SAR imagery with weak and full supervision to enable scalable pre-training.
  • A benchmark suite with 22 sub-benchmarks across 8 domain gaps establishes a unified standard for domain generalization in SAR semantic segmentation.
  • Experimental results show state-of-the-art performance on 20 benchmarks, with over 10% improvement in mIoU on some tasks under multi-gap transfer, and plans to release code, benchmarks, and datasets publicly.

Abstract

Synthetic Aperture Radar (SAR) enables global, all-weather earth observation. However, owing to diverse imaging mechanisms, domain shifts across sensors and regions severely hinder its semantic generalization. To address this, we present CrossEarth-SAR, the first billion-scale SAR vision foundation model built upon a novel physics-guided sparse mixture-of-experts (MoE) architecture incorporating physical descriptors, explicitly designed for cross-domain semantic segmentation. To facilitate large-scale pre-training, we develop CrossEarth-SAR-200K, a weakly and fully supervised dataset that unifies public and private SAR imagery. We also introduce a benchmark suite comprising 22 sub-benchmarks across 8 distinct domain gaps, establishing the first unified standard for domain generalization semantic segmentation on SAR imagery. Extensive experiments demonstrate that CrossEarth-SAR achieves state-of-the-art results on 20 benchmarks, surpassing previous methods by over 10\% mIoU on some benchmarks under multi-gap transfer. All code, benchmark and datasets will be publicly available.