BigEarthNet.txt: A Large-Scale Multi-Sensor Image-Text Dataset and Benchmark for Earth Observation
arXiv cs.CV / 4/1/2026
📰 NewsSignals & Early TrendsModels & Research
Key Points
- BigEarthNet.txt is introduced as a large-scale multi-sensor Earth observation (RS) image-text dataset built from co-registered Sentinel-1 SAR and Sentinel-2 multispectral imagery.
- The dataset includes 464,044 images paired with 9.6M text annotations featuring geographically anchored captions, visual question answering, and referring-expression instructions for bounding-box prediction.
- The authors report that BigEarthNet.txt offers greater textual richness and more diverse annotation types than prior RS image-text datasets.
- A manually verified benchmark split is provided to evaluate vision-language models on RS and CV tasks, highlighting current model limitations on complex land-use/land-cover (LULC) classes.
- Fine-tuning with BigEarthNet.txt is reported to yield consistent performance improvements across the evaluated tasks.
Related Articles

Black Hat Asia
AI Business

Big Tech firms are accelerating AI investments and integration, while regulators and companies focus on safety and responsible adoption.
Dev.to

Early Detection of Breast Cancer using SVM Classifier Technique
Dev.to

I Started Writing for Others. It Changed How I Learn.
Dev.to

Prompt-Driven Intune Administration | A Rahsi Framework™
Dev.to