Detection and Classification of (Pre)Cancerous Cells in Pap Smears: An Ensemble Strategy for the RIVA Cervical Cytology Challenge

arXiv cs.CV / 3/26/2026

💬 OpinionSignals & Early TrendsIdeas & Deep AnalysisModels & Research

Key Points

  • The paper addresses automated detection and multi-class classification of eight Bethesda categories from Pap smear images, targeting improved cervical cancer screening performance at scale despite class imbalance and nuclear overlap.
  • It evaluates three mitigation strategies—loss reweighting, data resampling, and transfer learning—using YOLOv11m as a base detector architecture.
  • The authors create an ensemble that combines independently trained models and merges their outputs via Weighted Boxes Fusion (WBF) to leverage complementary detection behavior.
  • On the RIVA Cervical Cytology Challenge (ISBI 2026) preliminary and final test sets, the ensemble reaches mAP50-95 values of 0.201 and 0.147 respectively, including a reported 29% improvement over the best single model on the final test set.

Abstract

Automated detection and classification of cervical cells in conventional Pap smear images can strengthen cervical cancer screening at scale by reducing manual workload, improving triage, and increasing consistency across readers. However, it is challenged by severe class imbalance and frequent nuclear overlap. We present our approach to the RIVA Cervical Cytology Challenge (ISBI 2026), which requires multi-class detection of eight Bethesda cell categories under these conditions. Using YOLOv11m as the base architecture, we systematically evaluate three strategies to improve detection performance: loss reweighting, data resampling and transfer learning. We build an ensemble by combining models trained under each strategy, promoting complementary detection behavior and combining them through Weighted Boxes Fusion (WBF). The ensemble achieves a mAP50-95 of 0.201 on the preliminary test set and 0.147 on the final test set, representing a 29% improvement over the best individual model on the final test set and demonstrating the effectiveness of combining complementary imbalance mitigation strategies.