Task-guided Spatiotemporal Network with Diffusion Augmentation for EEG-based Dementia Diagnosis and MMSE Prediction

arXiv cs.LG / 4/28/2026

📰 NewsIdeas & Deep AnalysisModels & Research

Key Points

  • The paper presents a task-guided spatiotemporal neural network (TGSN) for diagnosing dementia from EEG signals and predicting MMSE scores, motivated by the link between neurophysiology and cognitive impairment.
  • To reduce negative transfer in conventional multi-task learning, it uses a task-guided query module for task-specific feature extraction and a gated spatiotemporal attention mechanism to model long-range spatial dependencies and temporal dynamics.
  • The method also improves representation quality via a multi-band EEG feature fusion module and a pre-trained diffusion-process data augmentation module to increase sample diversity.
  • Experiments on the XY02 dataset show strong gains over prior methods, including 97.78% classification accuracy for AD/FTD and 83.93% for AD/FTD/VCI, plus lower MMSE prediction RMSE (1.93 and 2.38).
  • The authors report that evaluation on the DS004504 dataset supports good cross-dataset generalization, suggesting the approach may transfer beyond the original training data.

Abstract

Patients with dementia typically exhibit cognitive impairment, which is routinely assessed using the Mini-Mental State Examination (MMSE). Concurrently, their underlying neurophysiological abnormalities are reflected in Electroencephalography (EEG), providing a basis for joint modeling. However, traditional multi-task approaches suffer from feature entanglement, which leads to inter-task interference when handling heterogeneous objectives.To address this challenge, we propose a task-guided spatiotemporal network (TGSN) with diffusion augmentation for EEG-based dementia diagnosis and MMSE prediction. Specifically, TGSN integrates a multi-band feature fusion module to capture complementary spectral information from EEG. Meanwhile, a pre-trained data augmentation module utilizing a diffusion process is introduced toincrease sample diversity. To model the complex spatiotemporal patterns of EEG, we propose a gated spatiotemporal attention module that captures long-range spatial dependencies and temporal dynamics. Moreover, we design a task-guided query module to achieve task-specific feature extraction, thereby mitigating task interference. The effectiveness of TGSN is evaluated on the XY02 dataset. Experimental results demonstrate that the proposed network outperforms several state-of-the-art methods, achieving classification accuracies of 97.78\% for Alzheimer's Disease (AD)/Frontotemporal Dementia (FTD) and 83.93\% for AD/FTD/Vascular Cognitive Impairment (VCI), which exceed the best baselines by 16.39\% and 8.28\%, respectively. In parallel, it reduces the RMSE for MMSE prediction to 1.93 and 2.38, achieving significant error reductions of 1.44 and 1.43 compared to the best baselines. Additionally, validation on the DS004504 dataset demonstrates strong cross-dataset generalization...

Continue reading this article on the original site.

Read original →