OMCL: Open-vocabulary Monte Carlo Localization
arXiv cs.RO / 4/3/2026
💬 OpinionIdeas & Deep AnalysisModels & Research
Key Points
- The paper presents OMCL (Open-vocabulary Monte Carlo Localization), an extension of Monte Carlo Localization that uses vision-language features to compute observation likelihoods from a camera pose and a 3D map.
- It targets cases where robot measurements and the map come from different sensor modalities, addressing limitations of prior closed-set, environment-specific localization methods.
- OMCL supports cross-modality association between visual observations and map elements, enabling global localization initialization directly from natural-language descriptions of nearby objects.
- Experiments on Matterport3D and Replica demonstrate indoor robustness, and results on SemanticKITTI show outdoor generalization.




