Improved Multiscale Structural Mapping with Supervertex Vision Transformer for the Detection of Alzheimer's Disease Neurodegeneration

arXiv cs.CV / 4/17/2026

📰 NewsDeveloper Stack & InfrastructureModels & Research

Key Points

  • The study proposes MSSM+, an enhanced multiscale structural mapping framework that improves MRI-based Alzheimer’s disease (AD) screening by adding vertex-level sulcal depth and cortical curvature to the existing MSSM approach.
  • It introduces SSVM to partition the cortical surface into supervertices (surface patches) and uses a Supervertex Vision Transformer (SV-ViT) to learn anatomically informed patterns from these mesh-based representations.
  • Using 3D T1-weighted MRI data from AD patients and cognitively normal controls, MSSM+ finds more extensive and statistically significant structural differences than MSSM.
  • For AD vs. CN classification, MSSM+ improves performance, achieving a 3 percentage-point higher area under the precision-recall curve than MSSM, and shows better robustness across different MRI vendors.
  • The authors conclude that MSSM+ with SV-ViT could serve as a promising non-invasive imaging marker to detect AD neurodegeneration before confirmatory CSF/PET testing.

Abstract

Alzheimer's disease (AD) confirmation often relies on positron emission tomography (PET) or cerebrospinal fluid (CSF) analysis, which are costly and invasive. Consequently, structural MRI biomarkers such as cortical thickness (CT) are widely used for non-invasive AD screening. Multiscale structural mapping (MSSM) was recently proposed to integrate gray-white matter contrasts (GWCs) with CT from a single T1-weighted MRI (T1w) scan. Building on this framework, we propose MSSM+, together with surface supervertex mapping (SSVM) and a Supervertex Vision Transformer (SV-ViT). 3D T1w images from individuals with AD and cognitively normal (CN) controls were analyzed. MSSM+ extends MSSM by incorporating sulcal depth and cortical curvature at the vertex level. SSVM partitions the cortical surface into supervertices (surface patches) that effectively represent inter- and intra-regional spatial relationships. SV-ViT is a Vision Transformer architecture operating on these supervertices, enabling anatomically informed learning from surface mesh representations. Compared with MSSM, MSSM+ identified more spatially extensive and statistically significant group differences between AD and CN. In AD vs. CN classification, MSSM+ achieved a 3%p higher area under the precision-recall curve than MSSM. Vendor-specific analyses further demonstrated reduced signal variability and consistently improved classification performance across MR manufacturers relative to CT, GWCs, and MSSM. These findings suggest that MSSM+ combined with SV-ViT is a promising MRI-based imaging marker for AD detection prior to CSF/PET confirmation.