A Comprehensive Benchmark of Histopathology Foundation Models for Kidney Histopathology
arXiv cs.CV / 3/18/2026
📰 NewsSignals & Early TrendsIdeas & Deep AnalysisModels & Research
Key Points
- The study systematically evaluates 11 publicly available Histopathology Foundation Models (HFMs) across 11 kidney-specific downstream tasks, covering multiple stains, spatial scales, and clinical objectives.
- It employs tile-level repeated stratified group cross-validation and slide-level repeated nested stratified cross-validation, with Friedman test and pairwise Wilcoxon tests with Holm-Bonferroni correction to assess statistical significance.
- Results show moderate to strong performance on coarse meso-scale tasks such as diagnostic classification and detection of prominent structural alterations, but performance declines for fine-grained microstructural discrimination and prognosis-related signals, largely independent of stain type.
- The authors release kidney-hfm-eval, an open-source Python package, to reproduce the evaluation pipelines, and conclude that kidney-specific, multi-stain, and multimodal HFMs are needed for clinically reliable nephrology decision-making.
Related Articles
I Was Wrong About AI Coding Assistants. Here's What Changed My Mind (and What I Built About It).
Dev.to

Interesting loop
Reddit r/LocalLLaMA
Qwen3.5-122B-A10B Uncensored (Aggressive) — GGUF Release + new K_P Quants
Reddit r/LocalLLaMA
A supervisor or "manager" Al agent is the wrong way to control Al
Reddit r/artificial
FeatherOps: Fast fp8 matmul on RDNA3 without native fp8
Reddit r/LocalLLaMA