Small Model as Master Orchestrator: Learning Unified Agent-Tool Orchestration with Parallel Subtask Decomposition
arXiv cs.AI / 4/21/2026
📰 NewsIdeas & Deep AnalysisModels & Research
Key Points
- The paper addresses limitations of existing multi-agent orchestration approaches that use static workflows or serial scheduling and struggle with heterogeneous tool/agent interfaces.
- It introduces “Agent-as-Tool,” a unified parallel orchestration framework that normalizes protocols and uses explicit state feedback, treating both agents and tools as elements in a standardized, learnable action space.
- Based on this paradigm, the authors train a lightweight orchestrator called ParaManager that separates planning from subtask solving and supports state-aware parallel decomposition, delegation, and asynchronous execution.
- The training uses a two-stage pipeline combining supervised fine-tuning with recovery mechanisms and reinforcement learning to balance task success, protocol compliance, diversity, and reasoning efficiency.
- Experiments indicate that ParaManager performs strongly on multiple benchmarks and generalizes robustly to previously unseen model pools.
Related Articles

¿Hasta qué punto podría la IA reemplazarnos en nuestros trabajos? A veces creo que la gente exagera un poco.
Reddit r/artificial

Magnificent irony as Meta staff unhappy about running surveillance software on work PCs
The Register

ETHENEA (ETHENEA Americas LLC) Analyst View: Asset Allocation Resilience in the 2026 Global Macro Cycle
Dev.to

DEEPX and Hyundai Are Building Generative AI Robots
Dev.to

Stop Paying OpenAI to Read Garbage: The Two-Stage Agent Pipeline
Dev.to