FCL-COD: Weakly Supervised Camouflaged Object Detection with Frequency-aware and Contrastive Learning

arXiv cs.CV / 3/25/2026

💬 OpinionIdeas & Deep AnalysisModels & Research

Key Points

  • The paper introduces FCL-COD, a weakly supervised camouflaged object detection framework aimed at reducing reliance on labor-intensive mask annotations compared with fully supervised COD.
  • It proposes FoRA (Frequency-aware Low-rank Adaptation) to inject frequency-aware camouflage-scene knowledge into SAM, targeting failures such as non-camouflage object responses.
  • To handle local/extreme response issues and improve foreground-background separation, it uses gradient-aware contrastive learning for more precise boundary delineation.
  • It further adds a multi-scale frequency-aware representation learning strategy to strengthen refined boundary awareness.
  • Experiments on three COD benchmarks show that FCL-COD outperforms existing weakly supervised methods and even some fully supervised approaches, according to the authors.

Abstract

Existing camouflage object detection (COD) methods typically rely on fully-supervised learning guided by mask annotations. However, obtaining mask annotations is time-consuming and labor-intensive. Compared to fully-supervised methods, existing weakly-supervised COD methods exhibit significantly poorer performance. Even for the Segment Anything Model (SAM), there are still challenges in handling weakly-supervised camouflage object detection (WSCOD), such as: a. non-camouflage target responses, b. local responses, c. extreme responses, and d. lack of refined boundary awareness, which leads to unsatisfactory results in camouflage scenes. To alleviate these issues, we propose a frequency-aware and contrastive learning-based WSCOD framework in this paper, named FCL-COD. To mitigate the problem of non-camouflaged object responses, we propose the Frequency-aware Low-rank Adaptation (FoRA) method, which incorporates frequency-aware camouflage scene knowledge into SAM. To overcome the challenges of local and extreme responses, we introduce a gradient-aware contrastive learning approach that effectively delineates precise foreground-background boundaries. Additionally, to address the lack of refined boundary perception, we present a multi-scale frequency-aware representation learning strategy that facilitates the modeling of more refined boundaries. We validate the effectiveness of our approach through extensive empirical experiments on three widely recognized COD benchmarks. The results confirm that our method surpasses both state-of-the-art weakly supervised and even fully supervised techniques.

FCL-COD: Weakly Supervised Camouflaged Object Detection with Frequency-aware and Contrastive Learning | AI Navigate