FactorSmith: Agentic Simulation Generation via Markov Decision Process Decomposition with Planner-Designer-Critic Refinement
arXiv cs.AI / 2026/3/24
💬 オピニオンSignals & Early TrendsIdeas & Deep AnalysisModels & Research
要点
- FactorSmith is a framework for generating executable game simulations from natural-language specs by reducing LLM context needs through factored POMDP decomposition of the specification into modular steps.
- Each factored step uses a hierarchical planner–designer–critic agentic loop that iteratively refines generated code artifacts, guided by structured scoring and checkpoint rollback when quality degrades.
- The approach builds on FactorSim’s factored partially observable Markov decision process representation and applies an agentic trio workflow inspired by SceneSmith.
- The paper provides a formal mathematical basis for context selection and refinement mechanics and releases an open-source implementation.
- Experiments on the PyGame Learning Environment benchmark show improved prompt alignment, fewer runtime errors, and higher code quality versus non-agentic factored baselines.

