Anomaly Detection in IEC-61850 GOOSE Networks: Evaluating Unsupervised and Temporal Learning for Real-Time Intrusion Detection

arXiv cs.LG / 4/17/2026

💬 OpinionDeveloper Stack & InfrastructureIdeas & Deep AnalysisModels & Research

Key Points

  • IEC-61850 GOOSE, which is critical for time-sensitive substation communications, is vulnerable to replay, masquerade, and data injection because it lacks built-in security features.
  • The paper studies intrusion detection under strict sub-4ms latency requirements and limited labeled attack data, testing five models on the ERENO IEC-61850 dataset.
  • While a supervised Random Forest achieves the best overall F1 score (0.9516), it is too slow for real-time deployment (21.8ms per prediction), whereas all unsupervised temporal models meet the 4ms constraint.
  • Among the unsupervised approaches, a GRU-based recurrent sequence autoencoder provides the best accuracy-latency tradeoff (F1=0.8737 at 1.118ms).
  • Cross-environment tests on an independent dataset show performance drops under distribution shift for all models, but recurrent unsupervised models degrade less relative to the supervised baseline, indicating temporal modeling generalizes better than labeled attack fitting.
  • Thresholds for unsupervised models are chosen using a held-out validation split to prevent test-set leakage and ensure the reported results are reliable.

Abstract

The IEC-61850 GOOSE protocol underpins time-critical communication in modern digital substations but lacks native security mechanisms, leaving it vulnerable to replay, masquerade, and data injection attacks. Intrusion detection in this setting is challenging due to strict latency constraints (sub-4ms) and limited availability of labeled attack data. This paper evaluates whether unsupervised temporal modeling can provide effective and deployable anomaly detection for GOOSE networks. Five models are compared on the ERENO IEC-61850 dataset: a supervised Random Forest baseline, a feedforward Autoencoder, and three recurrent sequence autoencoders (RNN, LSTM, and GRU). The supervised Random Forest achieves the highest detection performance (F1=0.9516) but fails to meet real-time constraints at 21.8ms per prediction. All four unsupervised models satisfy the 4ms requirement, with the GRU achieving the best accuracy to latency tradeoff among them (F1=0.8737 at 1.118ms). A cross-environment evaluation on an independent dataset shows that all models degrade under distribution shift. However, recurrent models retain substantially higher relative performance than the supervised baseline, suggesting that temporal sequence modeling generalizes better than fitting labeled attack distributions. Anomaly thresholds for the unsupervised models are selected on a held out validation partition to avoid test set leakage. These results support unsupervised temporal models as a practical choice for real-time GOOSE intrusion detection, particularly in environments where labeled training data may be unavailable or where large-scale deployment across diverse substations is required.