CRC-SAM: SAM-Based Multi-Modal Segmentation and Quantification of Colorectal Cancer in CT, Colonoscopy, and Histology Images

arXiv cs.CV / 4/29/2026

💬 OpinionIdeas & Deep AnalysisModels & Research

Key Points

  • The paper introduces CRC-SAM, a unified model that segments colorectal cancer consistently across colonoscopy, CT, and histopathology images rather than relying on single-modality inputs.
  • CRC-SAM is built on MedSAM and uses LoRA (low-rank adaptation) layers added to a frozen encoder to enable efficient domain transfer with very few trainable parameters.
  • The approach targets underrepresented modalities by adapting the foundation-model-based segmentation pipeline in a lightweight way, improving portability across the clinical workflow.
  • Experiments on MSD-Colon, CVC-ClinicDB, and EBHI-Seg report better cross-modality performance than prior state-of-the-art baselines.
  • The results suggest that lightweight LoRA adaptation is an effective strategy for multi-modal colorectal cancer analysis using medical foundation models.

Abstract

We present CRC-SAM, a unified framework for colorectal cancer segmentation across colonoscopy, CT, and histopathology images. Unlike prior single-modality methods, CRC-SAM provides consistent, modality-agnostic segmentation throughout the clinical workflow. Built on MedSAM, it incorporates low-rank adaptation (LoRA) layers into a frozen encoder, enabling efficient domain transfer to underrepresented modalities with minimal trainable parameters. Experiments on MSD-Colon, CVC-ClinicDB, and EBHI-Seg demonstrate superior performance across modalities, outperforming state-of-the-art baselines and highlighting the effectiveness of lightweight LoRA adaptation for foundation-model-based colorectal cancer analysis.