A Hybridizable Neural Time Integrator for Stable Autoregressive Forecasting

arXiv cs.LG / 4/24/2026

📰 NewsIdeas & Deep AnalysisModels & Research

共有:

Key Points

The paper proposes a hybrid method that embeds an autoregressive transformer into a shooting-based mixed finite element scheme to expose topological structure and enable provable stability for chaotic dynamical systems.
It provides theoretical guarantees that forward inference preserves discrete energy and that training gradients are uniformly bounded, explicitly addressing exploding-gradient instability.
By combining the approach with a vision transformer, the method learns latent tokens whose dynamics are structure-preserving.
Experiments report substantial efficiency gains, including outperforming modern foundation models with a 65× reduction in parameters and achieving long-horizon forecasting improvements on chaotic systems.
A “mini-foundation” fusion component is shown to require only 12 simulations to train a real-time surrogate, delivering a 9,000× speedup over particle-in-cell simulation.

Abstract

For autoregressive modeling of chaotic dynamical systems over long time horizons, the stability of both training and inference is a major challenge in building scientific foundation models. We present a hybrid technique in which an autoregressive transformer is embedded within a novel shooting-based mixed finite element scheme, exposing topological structure that enables provable stability. For forward problems, we prove preservation of discrete energies, while for training we prove uniform bounds on gradients, provably avoiding the exploding gradient problem. Combined with a vision transformer, this yields latent tokens admitting structure-preserving dynamics. We outperform modern foundation models with a

65\times

reduction in model parameters and long-horizon forecasting of chaotic systems. A "mini-foundation" model of a fusion component shows that 12 simulations suffice to train a real-time surrogate, achieving a

9{,}000\times

speedup over particle-in-cell simulation.

The 67th Attempt: When Your "Knowledge Management" System Becomes a Self-Fulfilling Prophecy of Excellence

Dev.to

Context Engineering for Developers: A Practical Guide (2026)

Dev.to

GPT-5.5 is here. So is DeepSeek V4. And honestly, I am tired of version numbers.

Dev.to

I Built an AI Image Workflow with GPT Image 2.0 (+ Fixing Its Biggest Flaw)

Dev.to

Max-and-Omnis/Nemotron-3-Super-64B-A12B-Math-REAP-GGUF

Reddit r/LocalLLaMA

A Hybridizable Neural Time Integrator for Stable Autoregressive Forecasting

Key Points

Abstract

Related Articles

The 67th Attempt: When Your "Knowledge Management" System Becomes a Self-Fulfilling Prophecy of Excellence

Context Engineering for Developers: A Practical Guide (2026)

GPT-5.5 is here. So is DeepSeek V4. And honestly, I am tired of version numbers.

I Built an AI Image Workflow with GPT Image 2.0 (+ Fixing Its Biggest Flaw)

Max-and-Omnis/Nemotron-3-Super-64B-A12B-Math-REAP-GGUF

関連おすすめサービス

Notta搭載AI議事録イヤホン ZENCHORD1

AI搭載ボイスレコーダー Plaud

画像高画質化AIツール Aiarty Image Enhancer