Training-Free Agentic AI: Probabilistic Control and Coordination in Multi-Agent LLM Systems
arXiv cs.CL / 3/17/2026
📰 NewsDeveloper Stack & InfrastructureModels & Research
Key Points
- REDEREF is a lightweight, training-free controller that coordinates multi-agent LLM collaboration to improve routing efficiency during recursive delegation.
- It combines belief-guided delegation with Thompson sampling to prioritize agents with historically positive marginal contributions, reflection-driven re-routing via a calibrated LLM or judge, and evidence-based selection rather than output averaging.
- Across multi-agent split-knowledge tasks, REDEREF reduces token usage by 28%, agent calls by 17%, and time-to-success by 19% compared with random recursive delegation.
- The method adapts gracefully under agent or judge degradation and does not require training or fine-tuning.
Related Articles
When AI Grows Up: Identity, Memory, and What Persists Across Versions
Dev.to
Teleport Just Pivoted to AI Agent Identity. VentureBeat Mapped the Governance Gap They Are Filling.
Dev.to
Agentic RAG Failure Modes: Retrieval Thrash, Tool Storms, and Context Bloat (and How to Spot Them Early)
Towards Data Science
OpenAI is throwing everything into building a fully automated researcher
MIT Technology Review
v1.82.3.dev.2
LiteLLM Releases