CTFS : Collaborative Teacher Framework for Forward-Looking Sonar Image Semantic Segmentation with Extremely Limited Labels

arXiv cs.CV / 3/24/2026

📰 NewsSignals & Early TrendsIdeas & Deep AnalysisModels & Research

Key Points

  • The paper proposes CTFS, a collaborative teacher semantic segmentation framework tailored to forward-looking sonar images, which suffer from speckle noise, low texture contrast, shadows, and geometric distortions.
  • It uses one general teacher plus multiple sonar-specific teachers and a multi-teacher alternating guidance strategy to help the student learn both general semantics and sonar-specific features.
  • To reduce harm from teachers producing noisy pseudo-labels, CTFS adds a cross-teacher reliability assessment that estimates pseudo-label reliability using prediction consistency and stability across views and teachers.
  • Experiments on the FLSMD dataset show that with only 2% labeled data, the method improves mIoU by 5.08% over other state-of-the-art approaches, indicating strong gains under extremely limited supervision.

Abstract

As one of the most important underwater sensing technologies, forward-looking sonar exhibits unique imaging characteristics. Sonar images are often affected by severe speckle noise, low texture contrast, acoustic shadows, and geometric distortions. These factors make it difficult for traditional teacher-student frameworks to achieve satisfactory performance in sonar semantic segmentation tasks under extremely limited labeled data conditions. To address this issue, we propose a Collaborative Teacher Semantic Segmentation Framework for forward-looking sonar images. This framework introduces a multi-teacher collaborative mechanism composed of one general teacher and multiple sonar-specific teachers. By adopting a multi-teacher alternating guidance strategy, the student model can learn general semantic representations while simultaneously capturing the unique characteristics of sonar images, thereby achieving more comprehensive and robust feature modeling. Considering the challenges of sonar images, which can lead teachers to generate a large number of noisy pseudo-labels, we further design a cross-teacher reliability assessment mechanism. This mechanism dynamically quantifies the reliability of pseudo-labels by evaluating the consistency and stability of predictions across multiple views and multiple teachers, thereby mitigating the negative impact caused by noisy pseudo-labels. Notably, on the FLSMD dataset, when only 2% of the data is labeled, our method achieves a 5.08% improvement in mIoU compared to other state-of-the-art approaches.