Functorial Neural Architectures from Higher Inductive Types

arXiv cs.LG / 3/18/2026

💬 OpinionIdeas & Deep AnalysisModels & Research

共有:

Key Points

The authors reframe compositional generalization as a problem of functoriality in decoders, deriving guarantees and impossibility results from a categorical perspective on architecture.
They implement Higher Inductive Type specifications as neural architectures via a monoidal functor from the path groupoid of a space to a category of parametric maps, turning path constructors into generator networks and composition into structural concatenation.
They prove that decoders built by structural concatenation are strict monoidal functors (thus compositional by construction), while softmax self-attention is not functorial for any non-trivial task.
Experiments on the torus, the wedge of circles (S^1 ∨ S^1), and the Klein bottle demonstrate substantial gains for functorial decoders, including 2-2.7x, 5.5-10x, and 46% error-gap closure on group-relations tasks.

Abstract

Neural networks systematically fail at compositional generalization -- producing correct outputs for novel combinations of known parts. We show that this failure is architectural: compositional generalization is equivalent to functoriality of the decoder, and this perspective yields both guarantees and impossibility results. We compile Higher Inductive Type (HIT) specifications into neural architectures via a monoidal functor from the path groupoid of a target space to a category of parametric maps: path constructors become generator networks, composition becomes structural concatenation, and 2-cells witnessing group relations become learned natural transformations. We prove that decoders assembled by structural concatenation of independently generated segments are strict monoidal functors (compositional by construction), while softmax self-attention is not functorial for any non-trivial compositional task. Both results are formalized in Cubical Agda. Experiments on three spaces validate the full hierarchy: on the torus (

\mathbb{Z}^2

), functorial decoders outperform non-functorial ones by 2-2.7x; on

S^1 \vee S^1

(

F_2

), the type-A/B gap widens to 5.5-10x; on the Klein bottle (

\mathbb{Z} \rtimes \mathbb{Z}

), a learned 2-cell closes a 46% error gap on words exercising the group relation.

AIに心を持たせる試みについて

note

AIと創作

note

まな式AI活用術で、人生が動き出した人たち

note

人間とLLMは、次に来る言葉をどう予測するのか

note

【AI時代の読書術】本を読んでも忘れる50歳が、AIで読書を「資産」に変えた泥臭い仕組み

note

Functorial Neural Architectures from Higher Inductive Types

Key Points

Abstract

Related Articles

AIに心を持たせる試みについて

AIと創作

まな式AI活用術で、人生が動き出した人たち

人間とLLMは、次に来る言葉をどう予測するのか

【AI時代の読書術】本を読んでも忘れる50歳が、AIで読書を「資産」に変えた泥臭い仕組み

関連おすすめサービス

Notta搭載AI議事録イヤホン ZENCHORD1

AI搭載ボイスレコーダー Plaud

画像高画質化AIツール Aiarty Image Enhancer