UI-Voyager: A Self-Evolving GUI Agent Learning via Failed Experience
arXiv cs.LG / 3/26/2026
📰 NewsSignals & Early TrendsIdeas & Deep AnalysisModels & Research
Key Points
- The paper introduces UI-Voyager, a two-stage autonomous mobile GUI agent designed to learn efficiently from failures in long-horizon, sparse-reward Android GUI tasks.
- In the first stage, it uses Rejection Fine-Tuning (RFT) to continuously co-evolve data and models in an autonomous loop, reducing reliance on manual annotation.
- In the second stage, it applies Group Relative Self-Distillation (GRSD) to locate critical fork points across group rollouts and generate dense step-level supervision from successful trajectories.
- Experiments on AndroidWorld report that the 4B model reaches an 81.0% Pass@1 success rate, outperforming many recent baselines and surpassing human-level performance.
- Ablation studies and case analyses support GRSD’s effectiveness in improving learning signal quality and credit assignment under ambiguous outcomes.
Related Articles
Regulating Prompt Markets: Securities Law, Intellectual Property, and the Trading of Prompt Assets
Dev.to
Mercor competitor Deccan AI raises $25M, sources experts from India
Dev.to
How We Got Local MCP Servers Working in Claude Cowork (The Missing Guide)
Dev.to
How Should Students Document AI Usage in Academic Work?
Dev.to

I asked my AI agent to design a product launch image. Here's what came back.
Dev.to