AI Navigate

GameUIAgent: An LLM-Powered Framework for Automated Game UI Design with Structured Intermediate Representation

arXiv cs.AI / 3/17/2026

📰 NewsIdeas & Deep AnalysisTools & Practical UsageModels & Research

Key Points

  • GameUIAgent translates natural language design descriptions into editable Figma assets using a Design Spec JSON intermediate representation within a six-stage neuro-symbolic pipeline.
  • The pipeline combines LLM generation, deterministic post-processing, and a Vision-Language Model-guided Reflection Controller to iteratively self-correct while guaranteeing non-regressive quality.
  • In evaluation across 110 test cases, three LLMs, and three UI templates, the study identifies a game-domain failure taxonomy (rarity-dependent degradation and visual emptiness) and two key empirical findings: a Quality Ceiling Effect and a Rendering-Evaluation Fidelity Principle.
  • These findings establish foundational principles for LLM-driven visual generation agents in game production and reveal limits on RC improvements and unintended effects of partial rendering on VLM evaluation.

Abstract

Game UI design requires consistent visual assets across rarity tiers yet remains a predominantly manual process. We present GameUIAgent, an LLM-powered agentic framework that translates natural language descriptions into editable Figma designs via a Design Spec JSON intermediate representation. A six-stage neuro-symbolic pipeline combines LLM generation, deterministic post-processing, and a Vision-Language Model (VLM)-guided Reflection Controller (RC) for iterative self-correction with guaranteed non-regressive quality. Evaluated across 110 test cases, three LLMs, and three UI templates, cross-model analysis establishes a game-domain failure taxonomy (rarity-dependent degradation; visual emptiness) and uncovers two key empirical findings. A Quality Ceiling Effect (Pearson r=-0.96, p<0.01) suggests that RC improvement is bounded by headroom below a quality threshold -- a visual-domain counterpart to test-time compute scaling laws. A Rendering-Evaluation Fidelity Principle reveals that partial rendering enhancements paradoxically degrade VLM evaluation by amplifying structural defects. Together, these results establish foundational principles for LLM-driven visual generation agents in game production.