Fed3D: Federated 3D Object Detection

arXiv cs.CV / 4/20/2026

📰 NewsDeveloper Stack & InfrastructureIdeas & Deep AnalysisModels & Research

Key Points

  • The paper introduces Fed3D, a federated learning framework for 3D object detection that targets privacy preservation in multi-robot perception networks.
  • It addresses 3D data and category distribution heterogeneity by using a local-global, class-aware loss to balance gradient updates across different object categories.
  • To cope with limited bandwidth, Fed3D reduces per-round communication by training and sharing a lightweight “federated 3D prompt module” with only a few learnable parameters.
  • Experiments on federated 3D object detection report that Fed3D achieves significantly better performance than state-of-the-art methods while using lower communication cost under limited local training data.

Abstract

3D object detection models trained in one server plays an important role in autonomous driving, robotics manipulation, and augmented reality scenarios. However, most existing methods face severe privacy concern when deployed on a multi-robot perception network to explore large-scale 3D scene. Meanwhile, it is highly challenging to employ conventional federated learning methods on 3D object detection scenes, due to the 3D data heterogeneity and limited communication bandwidth. In this paper, we take the first attempt to propose a novel Federated 3D object detection framework (i.e., Fed3D), to enable distributed learning for 3D object detection with privacy preservation. Specifically, considering the irregular input 3D object in local robot and various category distribution between robots could cause local heterogeneity and global heterogeneity, respectively. We then propose a local-global class-aware loss for the 3D data heterogeneity issue, which could balance gradient back-propagation rate of different 3D categories from local and global aspects. To reduce communication cost on each round, we develop a federated 3D prompt module, which could only learn and communicate the prompts with few learnable parameters. To the end, several extensive experiments on federated 3D object detection show that our Fed3D model significantly outperforms state-of-the-art algorithms with lower communication cost when providing the limited local training data.