AI Navigate

3D MRI-Based Alzheimer's Disease Classification Using Multi-Modal 3D CNN with Leakage-Aware Subject-Level Evaluation

arXiv cs.CV / 3/19/2026

📰 NewsIdeas & Deep AnalysisModels & Research

Key Points

  • The work proposes a multimodal 3D convolutional neural network that fuses structural T1 MRI with gray matter, white matter, and CSF probability maps from FAST segmentation to capture complementary neuroanatomical information for Alzheimer's disease classification.
  • It evaluates on the OASIS-1 dataset using 5-fold subject-level cross-validation, reporting a mean accuracy of 72.34% ± 4.66% and a ROC AUC of 0.7781 ± 0.0365.
  • The study emphasizes leakage-aware subject-level evaluation and compares volumetric results against slice-based approaches to show how data representation and evaluation strategy affect performance reporting.
  • GradCAM visualizations suggest the model attends to anatomically meaningful regions such as the medial temporal lobe and ventricular areas known to be associated with Alzheimer's structural changes, supporting interpretability.
  • The work establishes a reproducible subject-level benchmark and highlights the potential advantages of volumetric MRI analysis for AD classification.

Abstract

Deep learning has become an important tool for Alzheimer's disease (AD) classification from structural MRI. Many existing studies analyze individual 2D slices extracted from MRI volumes, while clinical neuroimaging practice typically relies on the full three dimensional structure of the brain. From this perspective, volumetric analysis may better capture spatial relationships among brain regions that are relevant to disease progression. Motivated by this idea, this work proposes a multimodal 3D convolutional neural network for AD classification using raw OASIS 1 MRI volumes. The model combines structural T1 information with gray matter, white matter, and cerebrospinal fluid probability maps obtained through FSL FAST segmentation in order to capture complementary neuroanatomical information. The proposed approach is evaluated on the clinically labelled OASIS 1 cohort using 5 fold subject level cross validation, achieving a mean accuracy of 72.34% plus or minus 4.66% and a ROC AUC of 0.7781 plus or minus 0.0365. GradCAM visualizations further indicate that the model focuses on anatomically meaningful regions, including the medial temporal lobe and ventricular areas that are known to be associated with Alzheimer's related structural changes. To better understand how data representation and evaluation strategies may influence reported performance, additional diagnostic experiments were conducted on a slice based version of the dataset under both slice level and subject level protocols. These observations help provide context for the volumetric results. Overall, the proposed multimodal 3D framework establishes a reproducible subject level benchmark and highlights the potential benefits of volumetric MRI analysis for Alzheimer's disease classification.