Semantic Interaction Information mediates compositional generalization in latent space

Key Points

To break this loop, the paper proposes Representation Classification Chains (RCCs) in a JEPA-style design that separates embedding and inference learning, and shows improved compositional generalization to novel combinations of relevant variables.

Abstract

Are there still barriers to generalization once all relevant variables are known? We address this question via a framework that casts compositional generalization as a variational inference problem over latent variables with parametric interactions. To explore this, we develop the Cognitive Gridworld, a stationary Partially Observable Markov Decision Process (POMDP) where observations are generated jointly by multiple latent variables, yet feedback is provided for only a single goal variable. This setting allows us to define Semantic Interaction Information (SII): a metric measuring the contribution of latent variable interactions to task performance. Using SII, we analyze Recurrent Neural Networks (RNNs) provided with these interactions, finding that SII explains the accuracy gap between Echo State and Fully Trained networks. Our analysis also uncovers a theoretically predicted failure mode where confidence decouples from accuracy, suggesting that utilizing interactions between relevant variables is a non-trivial capability. We then address a harder regime where the interactions must be learned by an embedding model. Learning how latent variables interact requires accurate inference, yet accurate inference depends on knowing those interactions. The Cognitive Gridworld reveals this circular dependence as a core challenge for continual meta-learning. We approach this dilemma via Representation Classification Chains (RCCs), a JEPA-style architecture that disentangles these processes: variable inference and variable embeddings are learned by separate modules through Reinforcement Learning and self-supervised learning, respectively. Lastly, we demonstrate that RCCs facilitate compositional generalization to novel combinations of relevant variables. Together, these results establish a grounded setting for evaluating goal-directed generalist agents.

Semantic Interaction Information mediates compositional generalization in latent space

Key Points

Abstract

Related Articles

Black Hat Asia

Just a helpful open-source contributor

v0.18.2rc0

South Korean AI Chipmaker Raises $400 Million for Inference

Ollama is now powered by MLX on Apple Silicon in preview

関連おすすめサービス

Notta搭載AI議事録イヤホン ZENCHORD1

AI搭載ボイスレコーダー Plaud

画像高画質化AIツール Aiarty Image Enhancer

Related Articles

Just a helpful open-source contributor
Reddit r/LocalLLaMA

South Korean AI Chipmaker Raises $400 Million for Inference
AI Business

Ollama is now powered by MLX on Apple Silicon in preview
Dev.to