AI Navigate

Deep Convolutional Architectures for EEG Classification: A Comparative Study with Temporal Augmentation and Confidence-Based Voting

arXiv cs.AI / 3/17/2026

💬 OpinionSignals & Early TrendsIdeas & Deep AnalysisModels & Research

Key Points

  • The paper compares three deep CNN architectures for EEG ERP classification: a CSP-based 2D CNN, a second 2D CNN trained on raw data, and a 3D CNN that jointly models spatiotemporal representations.
  • It introduces a temporal shift augmentation strategy during training to address ERP latency variability.
  • At inference, a confidence-based test-time voting mechanism is used to stabilize predictions across shifted trials.
  • Experimental evaluation with stratified five-fold cross-validation shows the 3D CNN significantly outperforms the 2D variants in AUC and balanced accuracy, while CSP provides a boost to the 2D model.
  • The results highlight the value of temporal-aware architectures and augmentation strategies for robust EEG signal classification.

Abstract

Electroencephalography (EEG) classification plays a key role in brain-computer interface (BCI) systems, yet it remains challenging due to the low signal-to-noise ratio, temporal variability of neural responses, and limited data availability. In this paper, we present a comparative study of deep learning architectures for classifying event-related potentials (ERPs) in EEG signals. The preprocessing pipeline includes bandpass filtering, spatial filtering, and normalization. We design and compare three main pipelines: a 2D convolutional neural network (CNN) using Common Spatial Pattern (CSP), a second 2D CNN trained directly on raw data for a fair comparison, and a 3D CNN that jointly models spatiotemporal representations. To address ERP latency variations, we introduce a temporal shift augmentation strategy during training. At inference time, we employ a confidence-based test-time voting mechanism to improve prediction stability across shifted trials. An experimental evaluation on a stratified five-fold cross-validation protocol demonstrates that while CSP provides a benefit to the 2D architecture, the proposed 3D CNN significantly outperforms both 2D variants in terms of AUC and balanced accuracy. These findings highlight the effectiveness of temporal-aware architectures and augmentation strategies for robust EEG signal classification.