A Digital Pathology Resource for Liver Cancer Quantification with Datasets, Benchmarks, and Tools

arXiv cs.CV / 4/28/2026

📰 NewsDeveloper Stack & InfrastructureTools & Practical UsageModels & Research

Key Points

  • The paper introduces HepatoBench, a patch-level histopathology image database for liver cancer that includes annotations across seven key tissue categories to fill a gap in fine-grained labeled data.
  • Using HepatoBench, the authors train and release an open-source deep learning tissue recognition (classification) model to identify tissue types on histology patches.
  • They also release a WSI-level tumor/non-tumor segmentation model that can localize lesion regions across whole-slide images.
  • By combining the patch-level tissue classifier with the WSI-level segmentation, the authors build HepatoQuant, an end-to-end tool for regional quantification that outputs tissue composition parsing and quantitative statistics.
  • The work further open-sources HepatoBench, the benchmarking protocol, and supporting tools to enable reproducible model development and fair comparisons of liver cancer quantification methods.

Abstract

Liver cancer, especially hepatocellular carcinoma (HCC), imposes a substantial global disease burden. Accurate diagnosis and prognostic assessment directly influence treatment selection and patient survival, and pathological examination remains the gold standard for liver cancer diagnosis. Identifying diverse tissue components and pathological subtypes on histopathology slides is crucial for estimating postoperative recurrence risk and overall prognosis. However, most publicly available resources are still provided at the whole-slide image (WSI) level, and well-annotated datasets for fine-grained tissue component identification in liver cancer are scarce, which hinders reproducible model development and the deployment of quantitative analysis tools. To address this gap, we release HepatoBench, a patch-level image database for liver cancer with annotations for seven key tissue categories. Based on HepatoBench, we train and open-source a deep learning classification model as a tissue recognition tool. Furthermore, we train a WSI-level tumor/non-tumor segmentation model to automatically localize lesion regions across entire slides. By integrating the patch-level tissue classifier with the WSI-level segmentation model, we build HepatoQuant, an end-to-end, disease-specific regional quantification tool for liver cancer, enabling a unified workflow from WSIs to tissue composition parsing and quantitative statistics. We also open-source HepatoBench, the benchmarking protocol, and supporting tools, providing a solid foundation for automated regional quantification and fair method comparison in liver cancer pathology.