Jeffreys Flow: Robust Boltzmann Generators for Rare Event Sampling via Parallel Tempering Distillation

arXiv cs.LG / 4/8/2026

📰 NewsSignals & Early TrendsIdeas & Deep AnalysisModels & Research

Key Points

  • The paper introduces “Jeffreys Flow,” a generative modeling framework for rare-event sampling that targets multimodal distributions where existing Boltzmann generators can suffer from catastrophic mode collapse due to reverse KL training.
  • Jeffreys Flow distills empirical data obtained from Parallel Tempering trajectories using the symmetric Jeffreys divergence to better balance local energy/target fidelity with global coverage of modes.
  • The authors claim that minimizing Jeffreys divergence suppresses mode collapse and corrects structural inaccuracies by distilling from the empirical reference sampling rather than relying solely on analytic objectives.
  • They report scalability and accuracy on challenging, non-convex multidimensional benchmarks and highlight applications including improved bias correction in Replica Exchange SGLD and large accelerations of importance sampling in Path Integral Monte Carlo for quantum thermal states.

Abstract

Sampling physical systems with rough energy landscapes is hindered by rare events and metastable trapping. While Boltzmann generators already offer a solution, their reliance on the reverse Kullback--Leibler divergence frequently induces catastrophic mode collapse, missing specific modes in multi-modal distributions. Here, we introduce the Jeffreys Flow, a robust generative framework that mitigates this failure by distilling empirical sampling data from Parallel Tempering trajectories using the symmetric Jeffreys divergence. This formulation effectively balances local target-seeking precision with global modes coverage. We show that minimizing Jeffreys divergence suppresses mode collapse and structurally corrects inherent inaccuracies via distillation of the empirical reference data. We demonstrate the framework's scalability and accuracy on highly non-convex multidimensional benchmarks, including the systematic correction of stochastic gradient biases in Replica Exchange Stochastic Gradient Langevin Dynamics and the massive acceleration of exact importance sampling in Path Integral Monte Carlo for quantum thermal states.