Scalable Context-Aware Graph Attention for Unsupervised Anomaly Detection in Large-Scale Mobile Networks

arXiv cs.AI / 5/4/2026

💬 OpinionIdeas & Deep AnalysisModels & Research

Key Points

  • The paper addresses the challenge of detecting anomalies in mobile networks using high-dimensional KPI time series across many network elements, where labeling incidents at scale is too costly for supervised methods.
  • It introduces C-MTAD-GAT, an unsupervised framework that uses context-aware multivariate time-series modeling with graph attention, lightweight context conditioning, and a dual-head decoder for reconstruction and multi-step forecasting.
  • The approach yields per-element, per-feature anomaly scores and converts them into alert decisions using fully unsupervised thresholds derived from validation residuals.
  • Experiments on the TELCO dataset show improved event-level affiliation and pointwise F1 with fewer alarms than previous graph-attention and VAE-based baselines.
  • The same method is deployed on nation-scale mobile network control-plane data, where operator feedback suggests the alerts are actionable and support scalable daily monitoring without labeled incidents.

Abstract

Mobile network operators must monitor thousands of heterogeneous network elements across the radio access network and the packet core, each exposing high-dimensional KPI time series. The scale and cost of incident labelling make supervised approaches impractical, motivating unsupervised anomaly detection robust to context shifts and nonstationarity. We propose \textbf{C-MTAD-GAT} (\emph{Context-aware Multivariate Time-series Anomaly Detection with Graph Attention}), an anomaly detection framework designed to operate as a single shared model across large populations of network elements. The model combines temporal and feature-wise graph attention with lightweight static and dynamic context conditioning and a dual-head decoder for reconstruction and multi-step forecasting. It produces per-element, per-feature anomaly scores, converted to alerts via fully unsupervised thresholds calibrated from validation residuals. On the TELCO dataset released with DC-VAE \cite{garcia2023onemodel}, C-MTAD-GAT improves event-level affiliation and pointwise F1 while generating fewer alarms than prior graph-attention and VAE-based baselines. We then apply the same system to nation-scale radio access and evolved packet core control-plane counter data from a mobile network operator, where it is deployed. Operator feedback indicates the alerts are actionable and support daily monitoring, showing scalability across domains without relying on labelled incidents.