GeoAI Agency Primitives

arXiv cs.CV / 4/3/2026

💬 OpinionSignals & Early TrendsIdeas & Deep AnalysisModels & Research

Key Points

  • The paper proposes “agency primitives” as an additional layer for GeoAI assistants, arguing that model advances alone have not delivered productivity gains for GIS practitioners.
  • It introduces a 9-item vocabulary of primitives (including navigation, perception, geo-referenced memory, and dual modeling) intended to support iterative, human-in-the-loop GIS workflows.
  • The work presents a benchmark designed to measure human productivity improvements from agentic GIS assistance rather than relying solely on model capability metrics.
  • The stated goal is to make agentic assistance in GIS more implementable, testable, and comparable across systems and evaluations.

Abstract

We present ongoing research on agency primitives for GeoAI assistants -- core capabilities that connect Foundation models to the artifact-centric, human-in-the-loop workflows where GIS practitioners actually work. Despite advances in satellite image captioning, visual question answering, and promptable segmentation, these capabilities have not translated into productivity gains for practitioners who spend most of their time producing vector layers, raster maps, and cartographic products. The gap is not model capability alone but the absence of an agency layer that supports iterative collaboration. We propose a vocabulary of 9 primitives for such a layer -- including navigation, perception, geo-referenced memory, and dual modeling -- along with a benchmark that measures human productivity. Our goal is a vocabulary that makes agentic assistance in GIS implementable, testable, and comparable.