Leverage Laws: A Per-Task Framework for Human-Agent Collaboration

arXiv cs.CL / 4/29/2026

💬 OpinionIdeas & Deep AnalysisModels & Research

Key Points

  • The paper introduces a per-task “leverage ratio” to quantify human–agent collaboration by relating displaced human work to the human effort needed for task specification, handling mid-run interruptions, and reviewing outcomes.
  • It models how an information requirement must flow through three time-cost channels, establishing directional limits on information density between human-to-agent and agent-to-human interactions.
  • The authors analyze asymptotic leverage behavior and show it separates into two scaling factors—capability and memory—with a non-zero planning floor driven by irreducible task novelty constrained by human throughput.
  • They extend the approach to a windowed leverage metric that supports recurring tasks and spawned subtasks, incorporating amortized system-design investment, while keeping both task-level and window-level leverage bounded.
  • The framework unifies themes from supervisory control, common ground, and mixed-initiative interaction into a single normative ratio and ends with several testable empirical research questions.

Abstract

We propose a per-task leverage ratio for human-agent collaboration: human work displaced by an agent, divided by the human time required to specify the task, resolve mid-run interrupts, and review the result. The denominator decomposes into three channels through which a conserved per-task information requirement must flow, each with its own time-cost scalar. We show that information density itself is directional and bounded by separate ceilings on human-to-agent and agent-to-human flow, and that the asymptotic behavior of leverage decomposes into two scaling axes (capability and memory) with a non-zero floor on the planning term set by irreducible task novelty bounded by human throughput. We extend this per-task analysis to a windowed leverage measure that accommodates recurring tasks, spawned subtasks, and amortized system-design investment. The per-task ceiling does not bind the windowed measure, though both remain bounded: L_{\text{task}} by per-task novelty, L_{\text{window}} by the stock of accumulated planning investment that pays out within the window. The framework operationalizes aspects of earlier qualitative work on supervisory control (Sheridan, 1992), common ground (Clark & Brennan, 1991), and mixed-initiative interaction (Horvitz, 1999) within a single normative ratio, and produces a list of testable empirical questions that we leave as open problems.