AI Navigate

Hierarchical Latent Structure Learning through Online Inference

arXiv cs.LG / 3/20/2026

📰 NewsIdeas & Deep AnalysisModels & Research

Key Points

  • HOLMES introduces a hierarchical online learning framework for multiscale latent structure, using a nested Chinese Restaurant Process prior and sequential Monte Carlo to enable trial-by-trial inference.
  • The approach tackles the limitations of online latent-cause models with flat partitions and offline hierarchical models by enabling online inference over hierarchical latent representations.
  • In simulations, HOLMES matches the predictive performance of flat models while learning more compact representations that enable one-shot transfer to higher-level latent categories.
  • In a context-dependent task with nested temporal structure, HOLMES improved outcome prediction relative to flat models, demonstrating its ability to discover hierarchical structure in sequential data.

Abstract

Learning systems must balance generalization across experiences with discrimination of task-relevant details. Effective learning therefore requires representations that support both. Online latent-cause models support incremental inference but assume flat partitions, whereas hierarchical Bayesian models capture multilevel structure but typically require offline inference. We introduce the Hierarchical Online Learning of Multiscale Experience Structure (HOLMES) model, a computational framework for hierarchical latent structure learning through online inference. HOLMES combines a variation on the nested Chinese Restaurant Process prior with sequential Monte Carlo inference to perform tractable trial-by-trial inference over hierarchical latent representations without explicit supervision over the latent structure. In simulations, HOLMES matched the predictive performance of flat models while learning more compact representations that supported one-shot transfer to higher-level latent categories. In a context-dependent task with nested temporal structure, HOLMES also improved outcome prediction relative to flat models. These results provide a tractable computational framework for discovering hierarchical structure in sequential data.