A Hybridizable Neural Time Integrator for Stable Autoregressive Forecasting

arXiv cs.LG / 4/24/2026

📰 NewsIdeas & Deep AnalysisModels & Research

Key Points

  • The paper proposes a hybrid method that embeds an autoregressive transformer into a shooting-based mixed finite element scheme to expose topological structure and enable provable stability for chaotic dynamical systems.
  • It provides theoretical guarantees that forward inference preserves discrete energy and that training gradients are uniformly bounded, explicitly addressing exploding-gradient instability.
  • By combining the approach with a vision transformer, the method learns latent tokens whose dynamics are structure-preserving.
  • Experiments report substantial efficiency gains, including outperforming modern foundation models with a 65× reduction in parameters and achieving long-horizon forecasting improvements on chaotic systems.
  • A “mini-foundation” fusion component is shown to require only 12 simulations to train a real-time surrogate, delivering a 9,000× speedup over particle-in-cell simulation.

Abstract

For autoregressive modeling of chaotic dynamical systems over long time horizons, the stability of both training and inference is a major challenge in building scientific foundation models. We present a hybrid technique in which an autoregressive transformer is embedded within a novel shooting-based mixed finite element scheme, exposing topological structure that enables provable stability. For forward problems, we prove preservation of discrete energies, while for training we prove uniform bounds on gradients, provably avoiding the exploding gradient problem. Combined with a vision transformer, this yields latent tokens admitting structure-preserving dynamics. We outperform modern foundation models with a 65\times reduction in model parameters and long-horizon forecasting of chaotic systems. A "mini-foundation" model of a fusion component shows that 12 simulations suffice to train a real-time surrogate, achieving a 9{,}000\times speedup over particle-in-cell simulation.