BrainDINO: A Brain MRI Foundation Model for Generalizable Clinical Representation Learning

arXiv cs.LG / 5/1/2026

📰 NewsTools & Practical UsageModels & Research

Key Points

  • The paper introduces BrainDINO, a self-distilled foundation model for learning generalizable representations from unlabeled brain MRI data.
  • BrainDINO was trained on about 6.6 million axial slices from 20 heterogeneous datasets, enabling a single learned representation to transfer across many different clinical and neuroimaging endpoints.
  • Using a frozen encoder with lightweight task-specific heads, the model supports diverse downstream tasks including tumor segmentation, disease classification, brain age estimation, post-stroke prediction, molecular status prediction, MRI sequence classification, and survival modeling.
  • Experiments show BrainDINO matches or outperforms existing natural-image and MRI-specific self-supervised baselines across tasks, with notable gains when labeled data are scarce.
  • Representation analyses indicate the learned features are anatomically organized and sensitive to pathology even without task-specific supervision, suggesting a scalable route to data-efficient brain MRI analysis without volumetric pretraining or full-network fine-tuning.

Abstract

Brain MRI underpins a wide range of neuroscientific and clinical applications, yet most learning-based methods remain task-specific and require substantial labeled data. Here we show that a single self-supervised representation can generalize across heterogeneous brain MRI endpoints. We trained BrainDINO, a self-distilled foundation model, on approximately 6.6 million unlabeled axial slices from 20 datasets encompassing broad variation in population, disease, and acquisition setting. Using a frozen encoder with lightweight task heads, BrainDINO supported transfer across tumor segmentation, neurodegenerative and neurodevelopmental conditions classification, brain age estimation, post-stroke temporal prediction, molecular status prediction, MRI sequence classification, and survival modeling. Across tasks and supervision regimes, BrainDINO consistently equaled or exceeded natural-image and MRI-specific self-supervised baselines, with particularly strong advantages under label scarcity. Representation analyses further showed anatomically organized and pathology-sensitive feature structure in the absence of task-specific supervision. Our findings indicate that large-scale slice-wise self-supervised learning can yield a unified brain MRI representation that supports diverse neuroimaging tasks without volumetric pretraining or full-network fine-tuning, establishing a scalable foundation for robust and data-efficient brain imaging analysis.