Explore Like Humans: Autonomous Exploration with Online SG-Memo Construction for Embodied Agents
arXiv cs.CV / 4/22/2026
📰 NewsIdeas & Deep AnalysisModels & Research
Key Points
- The paper argues that structured spatial memory is crucial for long-horizon embodied navigation, and criticizes existing two-stage, offline memory reconstruction approaches for being overly geometry-focused and missing semantic landmarks.
- It proposes ABot-Explorer, an online, RGB-only active exploration framework that unifies exploration and memory construction using Large Vision-Language Models (VLMs) to extract Semantic Navigational Affordances (SNA).
- ABot-Explorer incorporates SNAs into a hierarchical SG-Memo to emulate human-like exploration logic by prioritizing structural transit nodes for efficient coverage.
- The authors release a large-scale dataset extending InteriorGS with SNA and SG-Memo annotations, enabling research on semantic-aligned memory construction.
- Experiments show ABot-Explorer achieves significantly better exploration efficiency and environment coverage than prior state-of-the-art methods, and its SG-Memo supports multiple downstream tasks effectively.



