Generalization Under Scrutiny: Cross-Domain Detection Progresses, Pitfalls, and Persistent Challenges

arXiv cs.CV / 4/10/2026

💬 OpinionSignals & Early TrendsIdeas & Deep AnalysisModels & Research

Key Points

  • The article surveys cross-domain object detection (CDOD), focusing on why models trained on one domain degrade sharply in unseen target domains due to shifts in sensing conditions, environments, and data distributions.
  • It proposes a unified problem formulation that treats detection under domain shift as a multi-stage process and organizes existing approaches via a taxonomy based on adaptation paradigms, assumptions, and pipeline components.
  • It explains how domain shift propagates across object-detection stages and argues that adaptation is more complex for detection than for classification because multiple components (e.g., localization and classification) are affected.
  • The survey reviews datasets, evaluation protocols, and benchmarking practices, highlighting fragmentation in the literature and the lack of unified perspectives on structural challenges.
  • It concludes by identifying persistent challenges and outlining promising future research directions to enable more robust cross-domain detection systems.

Abstract

Object detection models trained on a source domain often exhibit significant performance degradation when deployed in unseen target domains, due to various kinds of variations, such as sensing conditions, environments and data distributions. Hence, regardless the recent breakthrough advances in deep learning-based detection technology, cross-domain object detection (CDOD) remains a critical research area. Moreover, the existing literature remains fragmented, lacking a unified perspective on the structural challenges underlying domain shift and the effectiveness of adaptation strategies. This survey provides a comprehensive and systematic analysis of CDOD. We start upon a problem formulation that highlights the multi-stage nature of object detection under domain shift. Then, we organize the existing methods through a conceptual taxonomy that categorizes approaches based on adaptation paradigms, modeling assumptions, and pipeline components. Furthermore, we analyze how domain shift propagates across detection stages and discuss why adaptation in object detection is inherently more complex than in classification. In addition, we review commonly used datasets, evaluation protocols, and benchmarking practices. Finally, we identify the key challenges and outline promising future research directions. Cohesively, this survey aims to provide a unified framework for understanding CDOD and to guide the development of more robust detection systems.