The Global-Local loop: what is missing in bridging the gap between geospatial data from numerous communities?

arXiv cs.CV / 3/24/2026

💬 OpinionSignals & Early TrendsIdeas & Deep AnalysisModels & Research

Key Points

  • The paper argues that the geospatial field has access to unprecedented, multi-scale data from both satellites and citizen/in-situ sources, but still struggles to leverage it effectively across communities and applications.
  • It critiques common “master-slave” data fusion approaches that predominantly treat one mainstream dataset as primary and use other sources only to support it, limiting mutual benefit and introducing community bias.
  • The authors propose addressing missing “symmetrized” fusion and stronger feedback/bridging mechanisms—termed a “global-local loop”—to enable retroactions between scales, communities, and data types.
  • Through illustrative use cases, the paper outlines which interaction schemes are most relevant and highlights under-explored research directions for building more effective generic and thematic geospatial solutions.

Abstract

We face a unprecedented amount of geospatial data, describing directly or indirectly the Earth Surface at multiple spatial, temporal, and semantic scales, and stemming from numerous contributors, from satellites to citizens. The main challenge in all the geospatial-related communities lies in suitably leveraging a combination of some of the sources for either a generic or a thematic application. Certain data fusion schemes are predominantly exploited: they correspond to popular tasks with mainstream data sources, e.g., free archives of Sentinel images coupled with OpenStreetMap data under an open and widespread deep-learning backbone for land-cover mapping purposes. Most of these approaches unfortunately operate under a "master-slave" paradigm, where one source is basically integrated to help processing the "main" source, without mutual advantages (e.g., large-scale estimation of a given biophysical variable using in-situ observations) and under a specific community bias. We argue that numerous key data fusion configurations, and in particular the effort in symmetrizing the exploitation of multiple data sources, are insufficiently addressed while being highly beneficial for generic or thematic applications. Bridges and retroactions between scales, communities and their respective sources are lacking, neglecting the utmost potential of such a "global-local loop". In this paper, we propose to establish the most relevant interaction schemes through illustrative use cases. We subsequently discuss under-explored research directions that could take advantage of leveraging available data through multiples extents and communities.