Robust volatility updates for Hierarchical Gaussian Filtering

arXiv cs.LG / 5/5/2026

📰 NewsModels & Research

Key Points

  • Hierarchical Gaussian Filtering (HGF) networks update an agent’s beliefs about hidden states using one-step mean and precision (inverse-variance) update equations across parent and child nodes.
  • For volatility-targeting (variance-targeting) HGF parent nodes, the original variance update formulation can yield negative posterior precision in parts of the parameter space, causing the updating algorithm to fail.
  • The report proposes a modified quadratic approximation to the variational energy for volatility-coupled nodes that prevents negative posterior precision.
  • The method interpolates between two quadratic expansions—anchored at the prior prediction and at a second mode computed in closed form using the Lambert W function—yielding robust update equations.
  • The resulting updates are claimed to remain valid throughout the full parameter space and to track the variational posterior accurately even under large prediction errors.

Abstract

Hierarchical Gaussian Filtering (HGF) networks allow for efficient updating of posterior distributions (beliefs) about hidden states of an agent's environment. HGF parent nodes can target the mean or variance of their children. New information entering at input nodes leads to a cascade of belief updates across the network according to one-step update equations for each node's mean and precision (inverse variance). However, the original form of the update equations for variance-targeting parents(volatility coupling) can in some regions of parameter space lead to negative posterior precision, a logical impossibility which causes the updating algorithm to terminate with an error. In this report, we introduce a modified quadratic approximation to the variational energy of volatility-coupled nodes that avoids negative posterior precision. The key idea is to interpolate between two quadratic expansions of the variational energy: one at the prior prediction and one at a second mode whose location is obtained in closed form via the Lambert W function. The resulting update equations are robust across the entire parameter space and faithfully track the variational posterior even for large prediction errors.