A Multi-head Attention Fusion Network for Industrial Prognostics under Discrete Operational Conditions

arXiv cs.LG / 4/14/2026

💬 OpinionSignals & Early TrendsIdeas & Deep AnalysisModels & Research

Key Points

  • The paper introduces a multi-head attention fusion neural network to improve industrial prognostics when systems operate under discrete, dynamically changing conditions.
  • It explicitly models three components: a monotonic degradation trend, discrete operational states encoded as dense embeddings (via clustering), and residual random noise for unexplained sensor variation.
  • The architecture combines BiLSTM with attention to better capture complex temporal dependencies by adaptively weighting time steps and sensor signals.
  • A fusion module integrates the degradation-trend branch with operating-state embeddings to capture interactions between underlying degradation and operational context.
  • Experiments on a NASA repository dataset show the approach is effective for prognostic prediction under varying operational conditions.

Abstract

Complex systems such as aircraft engines, turbines, and industrial machinery often operate under dynamically changing conditions. These varying operating conditions can substantially influence degradation behavior and make prognostic modeling more challenging, as accurate prediction requires explicit consideration of operational effects. To address this issue, this paper proposes a novel multi-head attention-based fusion neural network. The proposed framework explicitly models and integrates three signal components: (1) the monotonic degradation trend, which reflects the underlying deterioration of the system; (2) discrete operating states, identified through clustering and encoded into dense embeddings; and (3) residual random noise, which captures unexplained variation in sensor measurements. The core strength of the framework lies in its architecture, which combines BiLSTM networks with attention mechanisms to better capture complex temporal dependencies. The attention mechanism allows the model to adaptively weight different time steps and sensor signals, improving its ability to extract prognostically relevant information. In addition, a fusion module is designed to integrate the outputs from the degradation-trend branch and the operating-state embeddings, enabling the model to capture their interactions more effectively. The proposed method is validated using a dataset from the NASA repository, and the results demonstrate its effectiveness.