A Predictive View on Streaming Hidden Markov Models

arXiv stat.ML / 4/13/2026

📰 NewsSignals & Early TrendsIdeas & Deep AnalysisModels & Research

Key Points

  • The paper proposes a predictive-first optimization framework for streaming Hidden Markov Models that targets accurate step-ahead predictive distributions rather than full posterior recovery under a fully specified generative model.
  • It assumes online learning of regime-specific predictive model parameters while keeping a fixed transition prior over latent regimes, aiming to sequentially identify hidden regimes.
  • Because the space of regime paths grows exponentially, the authors replace infeasible exact filtering with a constrained projection problem in predictive-distribution space using a fixed hypothesis (beam) budget.
  • The method yields a closed-form, fully recursive deterministic algorithm equivalent to a renormalized top-$S$ posterior-weighted mixture, providing a principled derivation of beam search for HMMs without requiring EM or sampling.
  • Experiments show competitive prequential performance compared with Online EM and Sequential Monte Carlo when computational budgets are matched.

Abstract

We develop a predictive-first optimisation framework for streaming hidden Markov models. Unlike classical approaches that prioritise full posterior recovery under a fully specified generative model, we assume access to regime-specific predictive models whose parameters are learned online while maintaining a fixed transition prior over regimes. Our objective is to sequentially identify latent regimes while maintaining accurate step-ahead predictive distributions. Because the number of possible regime paths grows exponentially, exact filtering is infeasible. We therefore formulate streaming inference as a constrained projection problem in predictive-distribution space: under a fixed hypothesis budget, we approximate the full posterior predictive by the forward-KL optimal mixture supported on S paths. The solution is the renormalised top-S posterior-weighted mixture, providing a principled derivation of beam search for HMMs. The resulting algorithm is fully recursive and deterministic, performing beam-style truncation with closed-form predictive updates and requiring neither EM nor sampling. Empirical comparisons against Online EM and Sequential Monte Carlo under matched computational budgets demonstrate competitive prequential performance.