CCDNet: Learning to Detect Camouflage against Distractors in Infrared Small Target Detection

arXiv cs.CV / 4/1/2026

💬 OpinionIdeas & Deep AnalysisModels & Research

Key Points

  • The paper proposes CCDNet for infrared small target detection, focusing on two key problems: camouflage that blends targets into complex backgrounds and distractors that trigger false alarms.
  • CCDNet uses a Weighted Multi-branch Perceptron (WMP) backbone to aggregate self-conditioned multi-level features for better target/background representation.
  • It introduces an Aggregation-and-Refinement Fusion Neck (ARFN) that refines shallow/deep feature maps and bidirectionally reconstructs target–background relations to suppress complex backgrounds.
  • A new Contrastive-aided Distractor Discriminator (CaDD) is added to improve discrimination between true targets and similar-looking distractors via adaptive local/global similarity computation.
  • Experiments on infrared image datasets reportedly show CCDNet outperforming existing state-of-the-art methods.

Abstract

Infrared target detection (IRSTD) tasks have critical applications in areas like wilderness rescue and maritime search. However, detecting infrared targets is challenging due to their low contrast and tendency to blend into complex backgrounds, effectively camouflaging themselves. Additionally, other objects with similar features (distractors) can cause false alarms, further degrading detection performance. To address these issues, we propose a novel \textbf{C}amouflage-aware \textbf{C}ounter-\textbf{D}istraction \textbf{Net}work (CCDNet) in this paper. We design a backbone with Weighted Multi-branch Perceptrons (WMPs), which aggregates self-conditioned multi-level features to accurately represent the target and background. Based on these rich features, we then propose a novel Aggregation-and-Refinement Fusion Neck (ARFN) to refine structures/semantics from shallow/deep features maps, and bidirectionally reconstruct the relations between the targets and the backgrounds, highlighting the targets while suppressing the complex backgrounds to improve detection accuracy. Furthermore, we present a new Contrastive-aided Distractor Discriminator (CaDD), enforcing adaptive similarity computation both locally and globally between the real targets and the backgrounds to more precisely discriminate distractors, so as to reduce the false alarm rate. Extensive experiments on infrared image datasets confirm that CCDNet outperforms other state-of-the-art methods.