Working Paper: Towards Schema-based Learning from a Category-Theoretic Perspective

arXiv cs.AI / 4/14/2026

💬 OpinionSignals & Early TrendsIdeas & Deep AnalysisModels & Research

Key Points

  • スキーマベース学習(SBL)を、型理論・圏論に基づく階層的な枠組みとして4レベルに整理し、それぞれの役割(構文・表現実装・確率モデル・意味)を圏として定式化しています。
  • 構文スキーマから表現言語への実装写像や、Grothendieck構成による総圏の構成、さらにGir yモナドのKleisli圏へ写像して確率モデルを与える流れが示されています。
  • 行為主体(agent)レベルでは、duoidal構造を用いたワークフロー実行や、Mindカテゴリを通じて「記憶・予測モデル・認知カーネル」のモジュール化(成功条件や論理署名を含む)が提案されています。
  • 上位では、エージェントアーキテクチャカテゴリやWorldカテゴリにより、異種パラダイムとの比較やマルチエージェント/環境相互作用までを含む弱い階層n-カテゴリー構造として統合しています。

Abstract

We introduce a hierarchical categorical framework for Schema-Based Learning (SBL) structured across four interconnected levels. At the schema level, a free multicategory Sch_{syn} encodes fundamental schemas and transformations. An implementation functor \mathcal{I} maps syntactic schemas to representational languages, inducing via the Grothendieck construction the total category Sch_{impl}. Implemented schemas are mapped by a functor Model into the Kleisli category \mathbf{KL(G)} of the Giry monad, yielding probabilistic models, while an instances presheaf assigns evaluated instance spaces. A semantic category Sch_{sem}, defined as a full subcategory of \mathbf{KL(G)}, provides semantic grounding through an interpretation functor from Sch_{impl}. At the agent level, Sch_{impl} is equipped with a duoidal structure \mathcal{O}_{Sch} supporting schema-based workflows. A left duoidal action on the category Mind enables workflow execution over mental objects, whose components include mental spaces, predictive models, and a cognitive kernel composed of memory and cognitive modules. Each module is specified by schema-typed interfaces, duoidal workflows, a success condition, and a logical signature. Memory is formalized categorically via memory subsystems, a presheaf Data_M, a monoidal operation category Ops_M, and read/write natural transformations. Together with the Body category, Mind defines the embodied SBL agent. At higher levels, SBL is represented as an object of the agent architecture category ArchCat, enabling comparison with heterogeneous paradigms, while the World category models multi-agent and agent-environment interactions. Altogether, the framework forms a weak hierarchical n-categorical structure linking schema semantics, cognition, embodiment, architectural abstraction, and world-level interaction.