Terminal Agents Suffice for Enterprise Automation

arXiv cs.AI / 4/2/2026

💬 OpinionIdeas & Deep AnalysisTools & Practical UsageModels & Research

Key Points

  • The paper argues that enterprise automation agents do not necessarily need complex GUI-based or protocol-layer architectures like web agents or MCP-style abstractions.
  • It proposes that a coding agent restricted to a terminal and filesystem can accomplish many enterprise tasks more effectively by calling platform APIs directly.
  • Experiments across multiple real-world systems found these “terminal-only” agents match or outperform more complex agent designs.
  • The authors conclude that combining simple programmatic interfaces with strong foundation models is sufficient for practical, cost-aware enterprise automation.

Abstract

There has been growing interest in building agents that can interact with digital platforms to execute meaningful enterprise tasks autonomously. Among the approaches explored are tool-augmented agents built on abstractions such as Model Context Protocol (MCP) and web agents that operate through graphical interfaces. Yet, it remains unclear whether such complex agentic systems are necessary given their cost and operational overhead. We argue that a coding agent equipped only with a terminal and a filesystem can solve many enterprise tasks more effectively by interacting directly with platform APIs. We evaluate this hypothesis across diverse real-world systems and show that these low-level terminal agents match or outperform more complex agent architectures. Our findings suggest that simple programmatic interfaces, combined with strong foundation models, are sufficient for practical enterprise automation.

Terminal Agents Suffice for Enterprise Automation | AI Navigate