GMA-SAWGAN-GP: A Novel Data Generative Framework to Enhance IDS Detection Performance

arXiv cs.AI / 4/1/2026

💬 OpinionIdeas & Deep AnalysisModels & Research

Key Points

  • The paper introduces GMA-SAWGAN-GP, a data generative augmentation framework designed to improve Intrusion Detection System (IDS) generalization from known to unknown attacks.
  • It combines a self-attention-enhanced WGAN-GP with Gumbel-Softmax regularization for discrete/categorical feature handling and an MLP-based autoencoder manifold regularizer to stabilize training.
  • A lightweight entropy-regularized gating network adaptively balances adversarial versus reconstruction losses to reduce mode collapse and improve robustness.
  • Experiments on NSL-KDD, UNSW-NB15, and CICIDS2017 show average accuracy gains of 5.3% (binary) and 2.2% (multi-class), with notable improvements for unknown attacks under LOAO evaluation (AUROC +3.9%, TPR@5%FPR +4.8%).
  • Ablation studies confirm the performance contributions of individual components, supporting the overall effectiveness of the framework for mixed-type network traffic.

Abstract

Intrusion Detection System (IDS) is often calibrated to known attacks and generalizes poorly to unknown threats. This paper proposes GMA-SAWGAN-GP, a novel generative augmentation framework built on a Self-Attention-enhanced Wasserstein GAN with Gradient Penalty (WGAN-GP). The generator employs Gumbel-Softmax regularization to model discrete fields, while a Multilayer Perceptron (MLP)-based AutoEncoder acts as a manifold regularizer. A lightweight gating network adaptively balances adversarial and reconstruction losses via entropy regularization, improving stability and mitigating mode collapse. The self-attention mechanism enables the generator to capture both short- and long-range dependencies among features within each record while preserving categorical semantics through Gumbel-Softmax heads. Extensive experiments on NSL-KDD, UNSW-NB15, and CICIDS2017 using five representative IDS models demonstrate that GMA-SAWGAN-GP significantly improves detection performance on known attacks and enhances generalization to unknown attacks. Leave-One-Attack-type-Out (LOAO) evaluations using Area Under the Receiver Operating Characteristic (AUROC) and True Positive Rate at a 5 percent False Positive Rate confirm that IDS models trained on augmented datasets achieve higher robustness under unseen attack scenarios. Ablation studies validate the contribution of each component to performance gains. Compared with baseline models, the proposed framework improves binary classification accuracy by an average of 5.3 percent and multi-classification accuracy by 2.2 percent, while AUROC and True Positive Rate at a 5 percent False Positive Rate for unknown attacks increase by 3.9 percent and 4.8 percent, respectively, across the three datasets. Overall, GMA-SAWGAN-GP provides an effective approach to generative augmentation for mixed-type network traffic, improving IDS accuracy and resilience.