Paper Title: LoV3D: Grounding Cognitive Prognosis Reasoning in Longitudinal 3D Brain MRI via Regional Volume Assessments
arXiv cs.CV / 3/13/2026
📰 NewsIdeas & Deep AnalysisModels & Research
Key Points
- LoV3D introduces a pipeline for training 3D vision-language models to read longitudinal T1-weighted brain MRI and output region-level anatomical assessments with longitudinal comparisons, ultimately providing a three-class cognitive diagnosis (Cognitively Normal, Mild Cognitive Impairment, or Dementia) and a synthesized diagnostic summary.
- The approach grounds the final diagnosis by enforcing label consistency, longitudinal coherence, and biological plausibility to reduce the risk of hallucinations.
- It trains a clinically-weighted Verifier that scores candidate outputs against normative references from standardized volume metrics, enabling Direct Preference Optimization without any human annotation.
- On a subject-level held-out ADNI test set, LoV3D achieves 93.7% three-class diagnostic accuracy (a +34.8% improvement over a no-grounding baseline), 97.2% two-class accuracy, and strong zero-shot transfer performance across MIRIAD and AIBL, with code available at the provided GitHub repo.
Related Articles
How to Build an AI Team: The Solopreneur Playbook
Dev.to
CrewAI vs AutoGen vs LangGraph: Which Agent Framework to Use
Dev.to

14 Best Self-Hosted Claude Alternatives for AI and Coding in 2026
Dev.to
[P] Finetuned small LMs to VLM adapters locally and wrote a short article about it
Reddit r/MachineLearning
Experiment: How far can a 28M model go in business email generation?
Reddit r/LocalLLaMA