AI Navigate

Scaling Laws for Educational AI Agents

arXiv cs.AI / 3/13/2026

📰 NewsIdeas & Deep AnalysisModels & Research

Key Points

  • The article introduces the Agent Scaling Law for LLM-based educational agents, emphasizing that scaling depends on structured dimensions like role clarity, skill depth, tool completeness, runtime capability, and educator expertise injection.
  • It defines AgentProfile as a JSON-based specification and presents EduClaw, a profile-driven platform that builds and deploys hundreds of educational agent profiles.
  • EduClaw demonstrates scalability with over 330 agent profiles and more than 1,100 skill modules across K-12 subjects, illustrating practical impact.
  • The authors identify Tool Scaling and Skill Scaling as complementary axes and argue that improvements in education AI come from richer, well-structured capability systems rather than solely larger models.
  • They report empirical observations that educational agent performance scales predictably with profile structural richness, suggesting a new direction for educational AI development.

Abstract

While scaling laws for Large Language Models (LLMs) have been extensively studied along dimensions of model parameters, training data, and compute, the scaling behavior of LLM-based educational agents remains unexplored. We propose that educational agent capability scales not merely with the underlying model size, but through structured dimensions that we collectively term the Agent Scaling Law: role definition clarity, skill depth, tool completeness, runtime capability, and educator expertise injection. Central to this framework is AgentProfile, a structured JSON-based specification that serves as the mechanism enabling systematic capability growth of educational agents. We present EduClaw, a profile-driven multi-agent platform that operationalizes this scaling law, demonstrating its effectiveness through the construction and deployment of 330+ educational agent profiles encompassing 1,100+ skill modules across K-12 subjects. Our empirical observations suggest that educational agent performance scales predictably with profile structural richness. We identify two complementary scaling axes -- Tool Scaling and Skill Scaling -- as future directions, arguing that the path to more capable educational AI lies not solely in larger models, but in stronger structured capability systems.