Learning to Theorize the World from Observation

arXiv cs.LG / 5/6/2026

📰 NewsIdeas & Deep AnalysisModels & Research

Key Points

  • The paper argues that “understanding the world” should be modeled as building explicit internal theories, not only as making accurate future predictions from observations.
  • It introduces a new learning paradigm, Learning-to-Theorize, aimed at inferring explicit explanatory theories from raw, non-text observations.
  • The authors instantiate the paradigm with Neural Theorizer (NEO), a probabilistic neural model that induces latent programs as a learned Language of Thought and runs them via a shared transition model.
  • In NEO, theories are represented as executable compositional programs whose primitives can be recombined to explain unseen phenomena.
  • Experiments indicate that this approach supports explanation-driven generalization, where new observations are interpreted through the programs that generate them.

Abstract

What does it mean to understand the world? Contemporary world models often operationalize understanding as accurate future prediction in latent or observation space. Developmental cognitive science, however, suggests a different view: human understanding emerges through the construction of internal theories of how the world works, even before mature language is acquired. Inspired by this theory-building view of cognition, we introduce Learning-to-Theorize, a learning paradigm for inferring explicit explanatory theories of the world from raw, non-textual observations. We instantiate this paradigm with the Neural Theorizer (NEO), a probabilistic neural model that induces latent programs as a learned Language of Thought and executes them through a shared transition model. In NEO, a theory is represented as an executable, compositional program whose learned primitives can be systematically recombined to explain novel phenomena. Experiments show that this formulation enables explanation-driven generalization, allowing observations to be understood in terms of the programs that generate them.