Code-Space Response Oracles: Generating Interpretable Multi-Agent Policies with Large Language Models
arXiv cs.AI / 3/12/2026
💬 OpinionIdeas & Deep AnalysisModels & Research
Key Points
- CSRO replaces black-box RL oracles with LLMs by generating policies as human-readable code, improving interpretability and trust in multi-agent settings.
- It reframes best-response computation as a code-generation task and explores zero-shot prompting, iterative refinement, and AlphaEvolve (a distributed LLM-based evolutionary system).
- The approach achieves competitive performance with baselines while yielding a diverse, explainable set of policies, shifting focus from opaque policy parameters to interpretable algorithmic behavior.
- By leveraging pretrained LLM knowledge, CSRO can discover complex, human-like strategies that are easier to inspect, debug and reason about.




