Towards Realistic Open-Vocabulary Remote Sensing Segmentation: Benchmark and Baseline
arXiv cs.CV / 4/20/2026
📰 NewsSignals & Early TrendsModels & Research
Key Points
- The paper proposes OVRSISBenchV2, a large-scale, application-oriented benchmark to better evaluate open-vocabulary remote sensing image segmentation under realistic open-world geospatial demands.
- It introduces OVRSIS95K (about 95K image–mask pairs across 35 semantic categories) and expands evaluation with 10 downstream datasets, yielding 170K images and 128 categories to increase diversity and difficulty.
- OVRSISBenchV2 goes beyond general open-vocabulary segmentation by adding downstream protocols for building extraction, road extraction, and flood detection.
- The authors propose Pi-Seg, a baseline that improves transferability using a “positive-incentive noise” mechanism with learnable, semantically guided perturbations to broaden the visual-text feature space during training.
- Experiments across OVRSISBenchV1, OVRSISBenchV2, and downstream tasks show Pi-Seg performs strongly and consistently, especially on the harder OVRSISBenchV2 benchmark, and the code/datasets are publicly available.
Related Articles
Which Version of Qwen 3.6 for M5 Pro 24g
Reddit r/LocalLLaMA

From Theory to Reality: Why Most AI Agent Projects Fail (And How Mine Did Too)
Dev.to

GPT-5.4-Cyber: OpenAI's Game-Changer for AI Security and Defensive AI
Dev.to
Local LLM Beginner’s Guide (Mac - Apple Silicon)
Reddit r/artificial

Is Your Skill Actually Good? Systematically Validating Agent Skills with Evals
Dev.to