PanGuide3D: Cohort-Robust Pancreas Tumor Segmentation via Probabilistic Pancreas Conditioning and a Transformer Bottleneck

arXiv cs.LG / 4/24/2026

💬 OpinionIdeas & Deep AnalysisModels & Research

Key Points

  • PanGuide3D targets a key challenge in pancreas tumor segmentation from contrast-enhanced CT: models often fail when trained on one dataset cohort and tested on another due to cohort shift.
  • The method uses a probabilistic pancreas map produced by a dedicated pancreas decoder, then conditions a tumor decoder on this probability at multiple scales using differentiable soft gating.
  • To better capture long-range context under distribution shift without making the architecture complex, PanGuide3D adds a lightweight Transformer bottleneck in the U-Net bottleneck representation.
  • Experiments train on the PanTS dataset and test both in-cohort and out-of-cohort on MSD Task07, evaluating voxel metrics, patient-level tumor detection, subgroup performance (e.g., small tumors and difficult locations), volume-conditioned behavior, and calibration.
  • Results indicate PanGuide3D delivers the best overall tumor performance, improves cross-cohort generalization (especially for small tumors), and reduces anatomically implausible false positives, suggesting practical value for clinical workflows and multi-institutional studies.

Abstract

Pancreatic tumor segmentation in contrast-enhanced computed tomography (CT) is clinically important yet technically challenging: lesions are often small, heterogeneous, and easily confused with surrounding soft tissue, and models that perform well on one cohort frequently degrade under cohort shift. Our goal is to improve cross-cohort generalization while keeping the model architecture simple, efficient, and practical for 3D CT segmentation. We introduce PanGuide3D, a cohort-robust architecture with a shared 3D encoder, a pancreas decoder that predicts a probabilistic pancreas map, and a tumor decoder that is explicitly conditioned on this pancreas probability at multiple scales via differentiable soft gating. To capture long-range context under distribution shift, we further add a lightweight Transformer bottleneck in the U-Net bottleneck representation. We evaluate cohort transfer by training on the PanTS (Pancreatic Tumor Segmentation) cohort and testing both in-cohort (PanTS) and out-of-cohort on MSD (Medical Segmentation Decathlon) Task07 Pancreas, using matched preprocessing and training protocols across strong baselines. We collect voxel-level segmentation metrics, patient-level tumor detection, subgroup analyses by tumor size and anatomical location, volume-conditioned performance analyses, and calibration measurements to assess reliability. Across the evaluated models, PanGuide3D achieves the best overall tumor performance and shows improved cross-cohort generalization, particularly for small tumors and challenging anatomical locations, while reducing anatomically implausible false positives. These findings support probabilistic anatomical conditioning as a practical strategy for improving cross-cohort robustness in an end-to-end model and suggest potential utility for contouring support, treatment planning, and multi-institutional studies.