A Comprehensive Benchmark of Histopathology Foundation Models for Kidney Histopathology
arXiv cs.CV / 3/18/2026
📰 NewsSignals & Early TrendsIdeas & Deep AnalysisModels & Research
Key Points
- The study systematically evaluates 11 publicly available Histopathology Foundation Models (HFMs) across 11 kidney-specific downstream tasks, covering multiple stains, spatial scales, and clinical objectives.
- It employs tile-level repeated stratified group cross-validation and slide-level repeated nested stratified cross-validation, with Friedman test and pairwise Wilcoxon tests with Holm-Bonferroni correction to assess statistical significance.
- Results show moderate to strong performance on coarse meso-scale tasks such as diagnostic classification and detection of prominent structural alterations, but performance declines for fine-grained microstructural discrimination and prognosis-related signals, largely independent of stain type.
- The authors release kidney-hfm-eval, an open-source Python package, to reproduce the evaluation pipelines, and conclude that kidney-specific, multi-stain, and multimodal HFMs are needed for clinically reliable nephrology decision-making.
Related Articles

Check out this article on AI-Driven Reporting 2.0: From Manual Bottlenecks to Real-Time Decision Intelligence (2026 Edition)
Dev.to

SYNCAI
Dev.to
How AI-Powered Decision Making is Reshaping Enterprise Strategy in 2024
Dev.to
When AI Grows Up: Identity, Memory, and What Persists Across Versions
Dev.to
AI-Driven Reporting 2.0: From Manual Bottlenecks to Real-Time Decision Intelligence (2026 Edition)
Dev.to