Training-Free Agentic AI: Probabilistic Control and Coordination in Multi-Agent LLM Systems
arXiv cs.CL / 3/17/2026
📰 NewsDeveloper Stack & InfrastructureModels & Research
Key Points
- REDEREF is a lightweight, training-free controller that coordinates multi-agent LLM collaboration to improve routing efficiency during recursive delegation.
- It combines belief-guided delegation with Thompson sampling to prioritize agents with historically positive marginal contributions, reflection-driven re-routing via a calibrated LLM or judge, and evidence-based selection rather than output averaging.
- Across multi-agent split-knowledge tasks, REDEREF reduces token usage by 28%, agent calls by 17%, and time-to-success by 19% compared with random recursive delegation.
- The method adapts gracefully under agent or judge degradation and does not require training or fine-tuning.
Related Articles
I Built a Zombie Process Killer Because Claude Code Ate 14GB of My RAM
Dev.to
Data Augmentation Using GANs
Dev.to
Building Safety Guardrails for LLM Customer Service That Actually Work in Production
Dev.to

The New AI Agent Primitive: Why Policy Needs Its Own Language (And Why YAML and Rego Fall Short)
Dev.to

I came from Data Engineering stuff before jumping into LLM stuff, i am surprised that many people in this space never heard Elastic/OpenSearch
Reddit r/LocalLLaMA