AI Navigate

FedBPrompt: Federated Domain Generalization Person Re-Identification via Body Distribution Aware Visual Prompts

arXiv cs.CV / 3/16/2026

📰 NewsIdeas & Deep AnalysisModels & Research

Key Points

  • FedBPrompt introduces a federated domain generalization method for person re-identification that uses body distribution aware visual prompts to guide Transformer attention toward pedestrian-centric regions across decentralized clients.
  • The Body Distribution Aware Visual Prompts Mechanism (BAPM) combines Holistic Full Body Prompts to suppress background noise and Body Part Alignment Prompts to capture pose- and viewpoint-robust details.
  • A Prompt-based Fine-Tuning Strategy (PFTS) freezes the ViT backbone and updates only lightweight prompts to significantly reduce communication overhead while maintaining adaptability.
  • Experimental results show that BAPM improves feature discrimination and cross-domain generalization, with PFTS achieving gains in only a few aggregation rounds and easy integration into existing ViT-based FedDG-ReID frameworks, with code available at https://github.com/leavlong/FedBPrompt.

Abstract

Federated Domain Generalization for Person Re-Identification (FedDG-ReID) learns domain-invariant representations from decentralized data. While Vision Transformer (ViT) is widely adopted, its global attention often fails to distinguish pedestrians from high similarity backgrounds or diverse viewpoints -- a challenge amplified by cross-client distribution shifts in FedDG-ReID. To address this, we propose Federated Body Distribution Aware Visual Prompt (FedBPrompt), introducing learnable visual prompts to guide Transformer attention toward pedestrian-centric regions. FedBPrompt employs a Body Distribution Aware Visual Prompts Mechanism (BAPM) comprising: Holistic Full Body Prompts to suppress cross-client background noise, and Body Part Alignment Prompts to capture fine-grained details robust to pose and viewpoint variations. To mitigate high communication costs, we design a Prompt-based Fine-Tuning Strategy (PFTS) that freezes the ViT backbone and updates only lightweight prompts, significantly reducing communication overhead while maintaining adaptability. Extensive experiments demonstrate that BAPM effectively enhances feature discrimination and cross-domain generalization, while PFTS achieves notable performance gains within only a few aggregation rounds. Moreover, both BAPM and PFTS can be easily integrated into existing ViT-based FedDG-ReID frameworks, making FedBPrompt a flexible and effective solution for federated person re-identification. The code is available at https://github.com/leavlong/FedBPrompt.