AgentOpt v0.1 Technical Report: Client-Side Optimization for LLM-Based Agent

arXiv cs.LG / 2026/4/9

📰 ニュースDeveloper Stack & InfrastructureTools & Practical UsageModels & Research

要点

  • The paper introduces AgentOpt, described as the first framework-agnostic Python package focused on optimizing LLM-based agent pipelines from the client side rather than relying solely on server-side efficiency techniques.
  • It formalizes client-side resource allocation across pipeline stages, including decisions about model choice, local tools, and API budget, under task-specific constraints for quality, cost, and latency.
  • A key finding is that model selection can produce very large cost differences at matched accuracy, with reported best-vs-worst model-combination gaps reaching 13–32× in experiments.
  • AgentOpt includes eight search algorithms (e.g., Arm Elimination, Epsilon-LUCB, Threshold Successive Elimination, Bayesian Optimization) to efficiently navigate exponentially large model-combination spaces.
  • Experiments on four benchmarks show Arm Elimination achieves near-optimal accuracy while cutting evaluation budget by 24–67% versus brute-force search on most tasks, with code and results published online.

Abstract

AI agents are increasingly deployed in real-world applications, including systems such as Manus, OpenClaw, and coding agents. Existing research has primarily focused on \emph{server-side} efficiency, proposing methods such as caching, speculative execution, traffic scheduling, and load balancing to reduce the cost of serving agentic workloads. However, as users increasingly construct agents by composing local tools, remote APIs, and diverse models, an equally important optimization problem arises on the client side. Client-side optimization asks how developers should allocate the resources available to them, including model choice, local tools, and API budget across pipeline stages, subject to application-specific quality, cost, and latency constraints. Because these objectives depend on the task and deployment setting, they cannot be determined by server-side systems alone. We introduce AgentOpt, the first framework-agnostic Python package for client-side agent optimization. We first study model selection, a high-impact optimization lever in multi-step agent pipelines. Given a pipeline and a small evaluation set, the goal is to find the most cost-effective assignment of models to pipeline roles. This problem is consequential in practice: at matched accuracy, the cost gap between the best and worst model combinations can reach 13--32\times in our experiments. To efficiently explore the exponentially growing combination space, AgentOpt implements eight search algorithms, including Arm Elimination, Epsilon-LUCB, Threshold Successive Elimination, and Bayesian Optimization. Across four benchmarks, Arm Elimination recovers near-optimal accuracy while reducing evaluation budget by 24--67\% relative to brute-force search on three of four tasks. Code and benchmark results available at https://agentoptimizer.github.io/agentopt/.