TOL: Textual Localization with OpenStreetMap
arXiv cs.CV / 4/3/2026
📰 NewsIdeas & Deep AnalysisModels & Research
Key Points
- The paper introduces a new global localization task, “Textual Localization with OpenStreetMap (T2O),” aiming to estimate accurate 2-DoF urban positions from text descriptions without geometric inputs or GNSS initialization.
- It proposes TOL, a large-scale benchmark with ~121K text queries paired with OSM map tiles and coverage across Boston, Karlsruhe, and Singapore, totaling about 316 km of road trajectories.
- The authors develop TOLoc, a coarse-to-fine framework that uses direction-aware semantic features from both text and OSM tiles to retrieve candidate locations before regressing pose via a dedicated alignment module.
- Experimental results show TOLoc outperforms the best prior method by 6.53%, 9.93%, and 8.31% at 5m, 10m, and 25m thresholds, and it generalizes well to unseen environments.
- The dataset, code, and models are planned for public release via the provided GitHub repository.
Related Articles

Why I built an AI assistant that doesn't know who you are
Dev.to

DenseNet Paper Walkthrough: All Connected
Towards Data Science

Meta Adaptive Ranking Model: What Instagram Advertisers Gain in 2026 | MKDM
Dev.to

The Facebook insider building content moderation for the AI era
TechCrunch
Qwen3.5 vs Gemma 4: Benchmarks vs real world use?
Reddit r/LocalLLaMA