Disentangling Damage from Operational Variability: A Label-Free Self-Supervised Representation Learning Framework for Output-Only Structural Damage Identification

arXiv cs.LG / 4/22/2026

📰 NewsModels & Research

Key Points

  • The paper targets structural health monitoring reliability issues by addressing “nuisance” operational and environmental variations that can mask or mimic damage effects in vibration data.
  • It proposes a label-free, self-supervised representation learning framework using an autoencoder with disentangled latent representations trained directly from raw vibration acceleration signals.
  • A VICReg-based self-supervised invariance regularization is applied using baseline data assumed to keep structural damage constant while operational/environmental conditions vary, helping the model separate damage-related factors from variability.
  • The method also adds a frequency-domain constraint to align power spectral density reconstructed from the latent representation with the PSD computed from the input time series.
  • Experiments on real-world bridge and gearbox vibration datasets show robustness to operational variability, strong generalization, and good performance for both damage detection and quantification.

Abstract

Damage identification is a core task in structural health monitoring. In practice, however, its reliability is often compromised by confounding non-damage effects, such as variations in excitation and environmental conditions, which can induce changes comparable to or larger than those caused by structural damage. To address this challenge, this study proposes a self-supervised label-free disentangled representation learning framework for robust vibration-based structural damage identification. The proposed framework employs an autoencoder with two latent representations to learn directly from raw vibration acceleration signals. A self-supervised invariance regularization, implemented via Variance-Invariance-Covariance Regularization (VICReg), is imposed on one latent representation using baseline data where structural damage is assumed constant but operational and environmental conditions vary. In addition, a frequency-domain constraint is introduced to enforce agreement between the power spectral density reconstructed from the latent representation and that computed from the corresponding input time series. Together, these mechanisms promote disentanglement, enabling the learned representation to be sensitive to damage-related characteristics while remaining invariant to nuisance variability. The framework is trained in a fully end-to-end and label-free manner, requiring no prior information on damage, excitation, or environmental conditions, making it well-suited for real-world applications. Its effectiveness is validated on two distinct real-world vibration datasets, including a bridge and a gearbox. The results demonstrate robustness to operational variability, strong generalization capability, and good performance in both damage detection and quantification.