Multi-Head Residual-Gated DeepONet for Coherent Nonlinear Wave Dynamics

arXiv cs.LG / 4/15/2026

💬 OpinionSignals & Early TrendsIdeas & Deep AnalysisModels & Research

共有:

Key Points

本論文は、初期状態を特徴づける物理的に意味のあるコンパクトな記述子を、DeepONetの入力-出力学習に“残差モジュレーション”として組み込む新しい枠組みMH-RGを提案している。
MH-RGは、物理記述子を並列の条件付け経路で扱い、それを状態予測側に対する残差変調因子として作用させることで、黒箱的な高次回帰に留まらない学習を目指している。
構造としては、pre-branch residual modulator、branch residual gate、trunk residual gateに加え、低ランクのmulti-head機構を用いて複数の補完的な応答パターンを捉えつつパラメータ増大を抑えている。
ベンチマークとして非線形の保守的波動ダイナミクスや散逸的な閉じ込めダイナミクスで評価し、従来の特徴追加ベースラインより誤差が一貫して低く、位相コヒーレンスや物理的に重要な量の忠実度をより良く保持できると報告している。

Abstract

Coherent nonlinear wave dynamics are often strongly shaped by a compact set of physically meaningful descriptors of the initial state. Traditional neural operators typically treat the input-output mapping as a largely black-box high-dimensional regression problem, without explicitly exploiting this structured physical context. Common feature-integration strategies usually rely on direct concatenation or FiLM-style affine modulation in hidden latent spaces. Here we introduce a different paradigm, loosely inspired by the complementary roles of state evolution and physically meaningful observables in quantum mechanics: the wave field is learned through a standard DeepONet state pathway, while compact physical descriptors follow a parallel conditioning pathway and act as residual modulation factors on the state prediction. Based on this idea, we develop a Multi-Head Residual-Gated DeepONet (MH-RG), which combines a pre-branch residual modulator, a branch residual gate, and a trunk residual gate with a low-rank multi-head mechanism to capture multiple complementary conditioned response patterns without prohibitive parameter growth. We evaluate the framework on representative benchmarks including highly nonlinear conservative wave dynamics and dissipative trapped dynamics and further perform detailed mechanistic analyses of the learned multi-head gating behavior. Compared with feature-augmented baselines, MH-RG DeepONet achieves consistently lower error while better preserving phase coherence and the fidelity of physically relevant dynamical quantities.