StrEBM: A Structured Latent Energy-Based Model for Blind Source Separation

arXiv stat.ML / 4/21/2026

📰 NewsModels & Research

Key Points

  • The paper introduces StrEBM, a structured latent energy-based model designed to learn source-wise latent organization by giving different latent dimensions their own learnable structural biases.
  • It uses blind source separation as a testbed to directly observe whether latent dimensions evolve toward distinct source-like components during training.
  • StrEBM jointly optimizes latent trajectories, an observation-generation map, and source-wise structural parameters, with each latent dimension using its own energy-based formulation.
  • The study instantiates the source-wise energies using Gaussian-process-inspired energies with learnable length-scales, demonstrating that the overall framework is more general than this specific choice.
  • Experiments on synthetic multichannel data show effective source recovery under linear and nonlinear mixing, while also highlighting optimization challenges such as slow late-stage convergence and reduced stability with nonlinear observation mappings.

Abstract

This paper proposes StrEBM, a structured latent energy-based model for source-wise structured representation learning. The framework is motivated by a broader goal of promoting identifiable and decoupled latent organization by assigning different latent dimensions their own learnable structural biases, rather than constraining the entire latent representation with a single shared energy. In this sense, blind source separation is adopted here as a concrete and verifiable testbed, through which the evolution of latent dimensions toward distinct underlying components can be directly examined. In the proposed framework, latent trajectories are optimized directly together with an observation-generation map and source-wise structural parameters. Each latent dimension is associated with its own energy-based formulation, allowing different latent components to gradually evolve toward distinct source-like roles during training. In the present study, this source-wise energy design is instantiated using Gaussian-process-inspired energies with learnable length-scales, but the framework itself is not restricted to Gaussian processes and is intended as a more general structured latent EBM formulation. Experiments on synthetic multichannel signals under linear and nonlinear mixing settings show that the proposed model can recover source components effectively, providing an initial empirical validation of the framework. At the same time, the study reveals important optimization characteristics, including slow late-stage convergence and reduced stability under nonlinear observation mappings. These findings not only clarify the practical behavior of the current GP-based instantiation, but also establish a basis for future investigation of richer source-wise energy families and more robust nonlinear optimization strategies.