AI Navigate

WaveComm: Lightweight Communication for Collaborative Perception via Wavelet Feature Distillation

arXiv cs.CV / 3/17/2026

📰 NewsIdeas & Deep AnalysisModels & Research

Key Points

  • The article addresses scalability and real-time performance challenges in bandwidth-limited multi-agent sensing by proposing WaveComm.
  • WaveComm uses Discrete Wavelet Transform to decompose feature maps and transmits only low-frequency components, with high-frequency details reconstructed at the receiver by a lightweight generator.
  • A Multi-Scale Distillation (MSD) loss is employed to optimize reconstruction quality across pixel, structural, semantic, and distributional levels.
  • Experimental results on OPV2V and DAIR-V2X show that WaveComm maintains state-of-the-art perception performance while reducing communication volume to approximately 86-87% of the original.
  • Ablation studies validate the effectiveness of the key components and demonstrate competitive improvements in both communication efficiency and perception accuracy compared to existing approaches.

Abstract

In multi-agent collaborative sensing systems, substantial communication overhead from information exchange significantly limits scalability and real-time performance, especially in bandwidth-constrained environments. This often results in degraded performance and reduced reliability. To address this challenge, we propose WaveComm, a wavelet-based communication framework that drastically reduces transmission loads while preserving sensing performance in low-bandwidth scenarios. The core innovation of WaveComm lies in decomposing feature maps using Discrete Wavelet Transform (DWT), transmitting only compact low-frequency components to minimize communication overhead. High-frequency details are omitted, and their effects are reconstructed at the receiver side using a lightweight generator. A Multi-Scale Distillation (MSD) Loss is employed to optimize the reconstruction quality across pixel, structural, semantic, and distributional levels. Experiments on the OPV2V and DAIR-V2X datasets for LiDAR-based and camera-based perception tasks demonstrate that WaveComm maintains state-of-the-art performance even when the communication volume is reduced to 86.3% and 87.0% of the original, respectively. Compared to existing approaches, WaveComm achieves competitive improvements in both communication efficiency and perception accuracy. Ablation studies further validate the effectiveness of its key components.