Multi-AUV Ad-hoc Networks-Based Multi-Target Tracking Based on Scene-Adaptive Embodied Intelligence

arXiv cs.RO / 3/31/2026

💬 OpinionIdeas & Deep AnalysisModels & Research

Key Points

  • The paper addresses the challenge of reliable multi-target tracking with multiple autonomous underwater vehicles (AUVs) operating over ad-hoc acoustic networks that suffer from dynamic topology changes and very limited bandwidth.
  • It proposes a scene-adaptive embodied intelligence (EI) architecture that unifies perception, decision-making, and physical execution in a closed cognitive loop, treating the communications channel as a dynamic constraint during control.
  • The authors introduce a beacon-based communication and control model to bridge high-level policy inference and decentralized physical actuation under constrained, fluctuating links.
  • A three-layer functional framework is presented, along with a Scene-Adaptive MARL (SA-MARL) algorithm that uses a dual-path critic with weight-based dynamic fusion to separate specialized tracking from global safety constraints.
  • Experiments indicate faster policy convergence and improved tracking accuracy versus mainstream MARL baselines, with robustness under strong environmental interference and fluid topological shifts.

Abstract

With the rapid advancement of underwater net-working and multi-agent coordination technologies, autonomous underwater vehicle (AUV) ad-hoc networks have emerged as a pivotal framework for executing complex maritime missions, such as multi-target tracking. However, traditional data-centricarchitectures struggle to maintain operational consistency under highly dynamic topological fluctuations and severely constrained acoustic communication bandwidth. This article proposes a scene-adaptive embodied intelligence (EI) architecture for multi-AUV ad-hoc networks, which re-envisions AUVs as embodied entities by integrating perception, decision-making, and physical execution into a unified cognitive loop. To materialize the functional interaction between these layers, we define a beacon-based communication and control model that treats the communication link as a dynamic constraint-aware channel, effectively bridging the gap between high-level policy inference and decentralized physical actuation. Specifically, the proposed architecture employs a three-layer functional framework and introduces a Scene-Adaptive MARL (SA-MARL) algorithm featuring a dual-path critic mechanism. By integrating a scene critic network and a general critic network through a weight-based dynamic fusion process, SA-MARL effectively decouples specialized tracking tasks from global safety constraints, facilitating autonomous policy evolution. Evaluation results demonstrate that the proposedscheme significantly accelerates policy convergence and achieves superior tracking accuracy compared to mainstream MARL approaches, maintaining robust performance even under intense environmental interference and fluid topological shifts.