MAC: Multi-Agent Constitution Learning
arXiv cs.AI / 3/18/2026
📰 NewsIdeas & Deep AnalysisTools & Practical UsageModels & Research
Key Points
- MAC introduces Multi-Agent Constitutional Learning, a framework to optimize over structured prompts as rule sets using a network of agents that can accept, edit, or reject rule updates.
- MAC+ further improves performance by training agents on successful trajectories to reinforce reward-maximizing updates.
- The approach yields human-readable, auditable rule sets and generalizes to other agentic tasks such as tool calling, with performance comparable to supervised fine-tuning and GRPO without parameter updates.
- It outperforms recent prompt-optimization methods by over 50%, addressing limitations of prior methods that require many labeled examples and lack structured prompts.




