AI Navigate

CD-FKD: Cross-Domain Feature Knowledge Distillation for Robust Single-Domain Generalization in Object Detection

arXiv cs.CV / 3/18/2026

📰 NewsModels & Research

Key Points

  • The paper introduces CD-FKD, a cross-domain feature distillation framework that improves single-domain generalization for object detection by leveraging global and instance-wise feature distillation.
  • The training strategy uses diversified data (downscaling and corruption) for the student while the teacher operates on original source-domain data to guide learning.
  • The student learns to mimic teacher features to extract object-centric representations, improving detection performance under challenging domain shifts, including corrupted scenarios.
  • Experiments demonstrate that CD-FKD outperforms state-of-the-art methods in both target-domain generalization and source-domain performance, with implications for real-world applications like autonomous driving and surveillance.

Abstract

Single-domain generalization is essential for object detection, particularly when training models on a single source domain and evaluating them on unseen target domains. Domain shifts, such as changes in weather, lighting, or scene conditions, pose significant challenges to the generalization ability of existing models. To address this, we propose Cross-Domain Feature Knowledge Distillation (CD-FKD), which enhances the generalization capability of the student network by leveraging both global and instance-wise feature distillation. The proposed method uses diversified data through downscaling and corruption to train the student network, whereas the teacher network receives the original source domain data. The student network mimics the features of the teacher through both global and instance-wise distillation, enabling it to extract object-centric features effectively, even for objects that are difficult to detect owing to corruption. Extensive experiments on challenging scenes demonstrate that CD-FKD outperforms state-of-the-art methods in both target domain generalization and source domain performance, validating its effectiveness in improving object detection robustness to domain shifts. This approach is valuable in real-world applications, like autonomous driving and surveillance, where robust object detection in diverse environments is crucial.