NaviMaster: Learning a Unified Policy for GUI and Embodied Navigation Tasks
arXiv cs.RO / 3/26/2026
💬 OpinionSignals & Early TrendsIdeas & Deep AnalysisModels & Research
Key Points
- The paper argues that GUI navigation and embodied navigation can be framed as Markov Decision Processes (MDPs), enabling a unified learning approach instead of treating them as separate domains.
- It introduces NaviMaster, a unified agent that learns both GUI and embodied navigation using a shared visual-target trajectory collection pipeline and a single reinforcement learning framework trained on mixed data.
- The method includes a distance-aware reward function designed to make learning more efficient from the collected trajectories.
- Experiments on out-of-domain benchmarks reportedly show NaviMaster outperforming prior state-of-the-art agents across GUI navigation, spatial affordance prediction, and embodied navigation, with ablation studies supporting the contributions of the unified training, data mixing, and reward design.
- The authors provide code, data, and checkpoints via the project website to support replication and further research.
Related Articles
Regulating Prompt Markets: Securities Law, Intellectual Property, and the Trading of Prompt Assets
Dev.to
Mercor competitor Deccan AI raises $25M, sources experts from India
Dev.to
How We Got Local MCP Servers Working in Claude Cowork (The Missing Guide)
Dev.to
How Should Students Document AI Usage in Academic Work?
Dev.to

I asked my AI agent to design a product launch image. Here's what came back.
Dev.to