Best Agent Identification for General Game Playing
arXiv stat.ML / 4/22/2026
💬 OpinionIdeas & Deep AnalysisModels & Research
Key Points
- The paper introduces a general procedure to identify the best (or near-best) performing algorithm for each sub-task in multi-problem domains by modeling it as multi-armed bandit best-arm identification.
- Each bandit represents a task and each arm represents an agent/algorithm, and the method uses an optimistic confidence-interval-based selection strategy to rank arms by their potential impact on simple regret.
- Experiments on General Video Game AI (GVGAI) and Ludii show substantial improvements over prior best-arm identification methods, reducing average simple regret and increasing the probability of correct identification.
- The approach is positioned as a way to improve agent evaluation quality and accuracy for general game playing frameworks and other multi-task settings where algorithm runtime is high.



