Granular Ball Guided Stable Latent Domain Discovery for Domain-General Crowd Counting

arXiv cs.CV / 3/26/2026

💬 OpinionSignals & Early TrendsIdeas & Deep AnalysisModels & Research

Key Points

  • The paper tackles single-source domain generalization for crowd counting, focusing on the hard problem of stable latent domain discovery when source features are heterogeneous and shift at test time.
  • It proposes a granular-ball guided approach that clusters compact local “granular balls” first and then clusters their centers, producing more stable and semantically consistent pseudo-domain assignments than flat sample-level clustering.
  • Using the discovered latent domains, it introduces a two-branch learning framework: a semantic codebook re-encoding branch to improve transferable semantic representations and a style branch to capture appearance variations.
  • The method is evaluated under a strict no-adaptation protocol on ShanghaiTech A/B, UCF-QNRF, and NWPU-Crowd, where it consistently outperforms strong baselines, especially when domain gaps are large.

Abstract

Single-source domain generalization for crowd counting remains highly challenging because a single labeled source domain often contains heterogeneous latent domains, while test data may exhibit severe distribution shifts. A fundamental difficulty lies in stable latent domain discovery: directly performing flat clustering on evolving sample-level latent features is easily affected by feature noise, outliers, and representation drift, leading to unreliable pseudo-domain assignments and weakened domain-structured learning. To address this issue, we propose a granular ball guided stable latent domain discovery framework for domain-general crowd counting. Specifically, the proposed method first organizes samples into compact local granular balls and then clusters granular ball centers as representatives to obtain pseudo-domains, transforming direct sample-level clustering into a hierarchical representative-based clustering process. This design yields more stable and semantically consistent pseudo-domain assignments. Built upon the discovered latent domains, we further develop a two-branch learning framework that enhances transferable semantic representations via semantic codebook re-encoding while modeling domain-specific appearance variations through a style branch, thereby reducing semantic--style entanglement and improving generalization under domain shifts. Extensive experiments on ShanghaiTech A/B, UCF\_QNRF, and NWPU-Crowd under a strict no-adaptation protocol demonstrate that the proposed method consistently outperforms strong baselines, especially under large domain gaps.

Granular Ball Guided Stable Latent Domain Discovery for Domain-General Crowd Counting | AI Navigate