The Tool Illusion: Rethinking Tool Use in Web Agents

arXiv cs.CL / 4/7/2026

💬 OpinionSignals & Early TrendsIdeas & Deep AnalysisTools & Practical UsageModels & Research

Key Points

  • The paper argues that prior research on tool use in web agents is inconclusive due to small experimental scales and non-comparable evaluation setups.
  • It presents a large, carefully controlled study covering multiple tool sources, backbone models, tool-use frameworks, and evaluation benchmarks to reassess whether tools reliably improve web-agent performance.
  • The authors find that some earlier conclusions about tool benefits need revision while other findings are supported with broader evidence.
  • The study also aims to clarify practical design principles for effective tools and identify potential side effects introduced by tool use.
  • Overall, it provides a more robust empirical foundation intended to guide future research and design of tool-use web agents.

Abstract

As web agents rapidly evolve, an increasing body of work has moved beyond conventional atomic browser interactions and explored tool use as a higher-level action paradigm. Although prior studies have shown the promise of tools, their conclusions are often drawn from limited experimental scales and sometimes non-comparable settings. As a result, several fundamental questions remain unclear: i) whether tools provide consistent gains for web agents, ii) what practical design principles characterize effective tools, and iii) what side effects tool use may introduce. To establish a stronger empirical foundation for future research, we revisit tool use in web agents through an extensive and carefully controlled study across diverse tool sources, backbone models, tool-use frameworks, and evaluation benchmarks. Our findings both revise some prior conclusions and complement others with broader evidence. We hope this study provides a more reliable empirical basis and inspires future research on tool-use web agents.