AI Navigate

Domain Adaptation Without the Compute Burden for Efficient Whole Slide Image Analysis

arXiv cs.CV / 3/18/2026

📰 NewsModels & Research

Key Points

  • The paper introduces EfficientWSI (eWSI), a method that combines Parameter-Efficient-Fine-Tuning (PEFT) with MIL to enable end-to-end training on whole slide images without heavy domain-specific pretraining.
  • eWSI achieves competitive or superior performance compared to MIL using in-domain feature extractors, even when paired with ImageNet-based features, reducing computational costs.
  • When used with in-domain feature extractors, eWSI further improves performance, showing its ability to capture task-specific information within histopathology.
  • The approach is evaluated on seven WSI-level tasks across Camelyon16, TCGA, and BRACS datasets, demonstrating broad applicability.
  • The work highlights a task-targeted, computationally efficient path for computational pathology, potentially enabling more scalable WSI analysis.

Abstract

Computational methods on analyzing Whole Slide Images (WSIs) enable early diagnosis and treatments by supporting pathologists in detection and classification of tumors. However, the extremely high resolution of WSIs makes end-to-end training impractical compared to typical image analysis tasks. To address this, most approaches use pre-trained feature extractors to obtain fixed representations of whole slides, which are then combined with Multiple Instance Learning (MIL) for downstream tasks. These feature extractors are typically pre-trained on natural image datasets such as ImageNet, which fail to capture domain-specific characteristics. Although domain-specific pre-training on histopathology data yields more relevant feature representations, it remains computationally expensive and fail to capture task-specific characteristics within the domain. To address the computational cost and lack of task-specificity in domain-specific pre-training, we propose EfficientWSI (eWSI), a careful integration of Parameter-Efficient-Fine-Tuning (PEFT) and Multiple Instance Learning (MIL) that enables end-to-end training on WSI tasks. We evaluate eWSI on seven WSI-level tasks over Camelyon16, TCGA and BRACS datasets. Our results show that eWSI when applied with ImageNet feature extractors yields strong classification performance, matching or outperforming MILs with in-domain feature extractors, alleviating the need for extensive in-domain pre-training. Furthermore, when eWSI is applied with in-domain feature extractors, it further improves classification performance in most cases, demonstrating its ability to capture task-specific information where beneficial. Our findings suggest that eWSI provides a task-targeted, computationally efficient path for WSI tasks, offering a promising direction for task-specific learning in computational pathology.