RSEdit: Text-Guided Image Editing for Remote Sensing
arXiv cs.CV / 3/17/2026
📰 NewsTools & Practical UsageModels & Research
Key Points
- RSEdit addresses artifacts and hallucinations that occur when applying general-domain text-guided image editors to remote sensing imagery due to limited RS knowledge and misaligned conditioning.
- It unifies pretrained diffusion models (U-Net and DiT) into instruction-following RS editors through channel concatenation and in-context token concatenation, enabling precise, physically coherent edits while preserving geospatial content.
- Trained on over 60,000 bi-temporal RS image pairs, RSEdit demonstrates strong gains over general and commercial baselines and generalizes across disaster impacts, urban growth, and seasonal shifts.
- The authors will release code, pretrained models, evaluation protocols, training logs, and generated results for full reproducibility, with code available at the linked GitHub repository.




