DRIVE-Nav: Directional Reasoning, Inspection, and Verification for Efficient Open-Vocabulary Navigation
arXiv cs.RO / 3/31/2026
📰 NewsSignals & Early TrendsIdeas & Deep AnalysisModels & Research
Key Points
- DRIVE-Nav is a structured framework for Open-Vocabulary Object Navigation that improves route stability by reasoning over persistent directional cues rather than dense, incomplete frontiers.
- It inspects and tracks directional candidates extracted from weighted Fast Marching Method (FMM) paths, then restricts future decisions to still-relevant directions within a forward 240° view range to cut redundant revisits and action overhead.
- DRIVE-Nav enhances grounding reliability by combining vision-language-guided prompt enrichment with cross-frame verification to better confirm target semantics across views.
- Experiments on HM3D-OVON, HM3Dv2, and MP3D show strong performance and efficiency gains, including 50.2% SR and 32.6% SPL on HM3D-OVON with improvements over prior bests.
- The method also performs well in transfer to a physical humanoid robot and in real-world deployment, indicating robustness beyond simulation.


