AutoSurfer -- Teaching Web Agents through Comprehensive Surfing, Learning, and Modeling
arXiv cs.AI / 5/1/2026
📰 NewsSignals & Early TrendsModels & Research
Key Points
- AutoSurfer is introduced as a new web trajectory generator designed to improve the quality and coverage of training data for web agents built with multimodal LLMs.
- It uses a systematic breadth-first exploration approach (with a queue of discovered pages and action traces, knowledge propagation, and recursive expansion of multi-level UI elements) to better cover a website’s action space.
- AutoSurfer grounds task synthesis in real exploration trajectories to reduce hallucinations and ambiguity, and then reuses those trajectories as hints to steer more accurate trajectory refinement by a web agent.
- Experiments on the WebArena benchmark show that fine-tuning Qwen2.5-VL-7B-Instruct with AutoSurfer-generated data outperforms prior methods, reaching up to 24.23% overall task completion accuracy versus 19.59% for the best previous approach.
- The paper also reports that AutoSurfer produces more diverse synthesized tasks, supporting its usefulness for training more robust website-specific LLM agents.
Related Articles

Why Autonomous Coding Agents Keep Failing — And What Actually Works
Dev.to

Text-to-image is easy. Chaining LLMs to generate, critique, and iterate on images autonomously is a routing nightmare. AgentSwarms now supports Image generation playground and creative media workflows!
Reddit r/artificial

Announcing the NVIDIA Nemotron 3 Super Build Contest
Dev.to

75% of Sites Blocking AI Bots Still Get Cited. Here Is Why Blocking Does Not Work.
Dev.to

How to Fix OpenClaw Tool Calling Issues
Dev.to