Towards Intelligent Geospatial Data Discovery: a knowledge graph-driven multi-agent framework powered by large language models

arXiv cs.AI / 2026/3/24

💬 オピニオンSignals & Early TrendsIdeas & Deep AnalysisModels & Research

要点

  • The paper addresses shortcomings of existing geospatial data catalogs, which largely rely on keyword search and struggle to capture user intent due to semantic inconsistencies across distributed metadata standards.
  • It proposes a knowledge-graph-driven multi-agent framework that uses a unified geospatial metadata ontology as a semantic mediation layer to align heterogeneous metadata.
  • The framework builds a geospatial metadata knowledge graph to represent datasets and their multidimensional relationships, enabling more structured retrieval.
  • A multi-agent architecture powered by large language models performs intent parsing, knowledge-graph retrieval, and answer synthesis to create an interpretable, closed-loop discovery workflow from query to results.
  • Experimental use cases indicate improved intent matching accuracy, ranking quality, recall, and discovery transparency versus traditional keyword-based systems, positioning the approach for next-generation Autonomous GIS.

Abstract

The rapid growth in the volume, variety, and velocity of geospatial data has created data ecosystems that are highly distributed, heterogeneous, and semantically inconsistent. Existing data catalogs, portals, and infrastructures still rely largely on keyword-based search with limited semantic support, which often fails to capture user intent and leads to weak retrieval performance. To address these challenges, this study proposes a knowledge graph-driven multi-agent framework for intelligent geospatial data discovery, powered by large language models. The framework introduces a unified geospatial metadata ontology as a semantic mediation layer to align heterogeneous metadata standards across platforms and constructs a geospatial metadata knowledge graph to explicitly model datasets and their multidimensional relationships. Building on the structured representation, the framework adopts a multi-agent collaborative architecture to perform intent parsing, knowledge graph retrieval, and answer synthesis, forming an interpretable and closed-loop discovery process from user queries to results. Results from representative use cases and performance evaluation show that the framework substantially improves intent matching accuracy, ranking quality, recall, and discovery transparency compared with traditional systems. This study advances geospatial data discovery toward a more semantic, intent-aware, and intelligent paradigm, providing a practical foundation for next-generation intelligent and autonomous spatial data infrastructures and contributing to the broader vision of Autonomous GIS.