GamED.AI: A Hierarchical Multi-Agent Framework for Automated Educational Game Generation

arXiv cs.AI / 4/28/2026

📰 NewsDeveloper Stack & InfrastructureTools & Practical UsageModels & Research

Key Points

  • GamED.AI is a hierarchical multi-agent framework that converts instructor-provided questions into fully playable, pedagogically grounded educational games using formal “mechanic contracts.”
  • The system is built with phase-based LangGraph sub-graphs, deterministic Quality Gates, and structured Pydantic schemas to ensure reliable game generation and validation.
  • It supports two template families covering 15 interaction mechanics, targeting spatial reasoning, procedural execution, and higher-order Bloom’s Taxonomy objectives.
  • In an evaluation on 200 questions across five subjects, GamED.AI reports a 90% validation pass rate and 98.3% schema compliance, alongside a 73% token reduction versus ReAct agents.
  • The authors claim that, in this setup, phase-bounded architectural structure correlates more strongly with alignment quality than prompting strategy alone, and they demo Bloom-aligned generation in under 60 seconds.

Abstract

We introduce GamED.AI, a hierarchical multi-agent framework that transforms instructor-provided questions into fully playable, pedagogically grounded educational games validated through formal mechanic contracts. Built on phase-based LangGraph sub-graphs, deterministic Quality Gates, and structured Pydantic schemas, GamED.AI supports two template families encompassing 15 interaction mechanics across spatial reasoning, procedural execution, and higher-order Bloom's Taxonomy objectives. Evaluated on 200 questions spanning five subject domains, the system achieves a 90% validation pass rate, 98.3% schema compliance, and 73% token reduction over ReAct agents ({\sim}73,500 \rightarrow {\sim}19,900 tokens/game) at $0.46 per game. Within this model configuration, these results suggest that phase-bounded architectural structure correlates more strongly with alignment quality than prompting strategy alone. Our demonstration lets attendees generate Bloom's-aligned games from natural language in under 60 seconds, inspect Quality Gate outputs at each pipeline phase, and browse a curated library of 50 games spanning all 15 mechanic types.