Unsupervised Anomaly Detection in Process-Complex Industrial Time Series: A Real-World Case Study

arXiv cs.LG / 4/16/2026

💬 OpinionIdeas & Deep AnalysisModels & Research

Key Points

  • The paper argues that real industrial time-series are more complex than benchmark datasets because of heterogeneous, multi-stage process dynamics that change across operation modes.
  • It introduces an empirical study using a dataset collected from fully operational industrial machinery, designed to reflect process-induced variability.
  • Model evaluations compare a classical Isolation Forest baseline against several autoencoder architectures, finding Isolation Forest insufficient for the non-periodic, multi-scale behavior.
  • Among autoencoders, temporal convolutional autoencoders show the most robust anomaly detection performance, while recurrent and variational autoencoders are effective but demand more careful hyperparameter tuning.

Abstract

Industrial time-series data from real production environments exhibits substantially higher complexity than commonly used benchmark datasets, primarily due to heterogeneous, multi-stage operational processes. As a result, anomaly detection methods validated under simplified conditions often fail to generalize to industrial settings. This work presents an empirical study on a unique dataset collected from fully operational industrial machinery, explicitly capturing pronounced process-induced variability. We evaluate which model classes are capable of capturing this complexity, starting with a classical Isolation Forest baseline and extending to multiple autoencoder architectures. Experimental results show that Isolation Forest is insufficient for modeling the non-periodic, multi-scale dynamics present in the data, whereas autoencoders consistently perform better. Among them, temporal convolutional autoencoders achieve the most robust performance, while recurrent and variational variants require more careful tuning.