Train-Small Deploy-Large: Leveraging Diffusion-Based Multi-Robot Planning

arXiv cs.RO / 4/9/2026

💬 OpinionIdeas & Deep AnalysisModels & Research

Key Points

  • 既存の学習ベースのマルチロボット経路計画は、訓練時のロボット数に固定されがちで、配備時にロボット数が増える状況で一般化できない課題がある。
  • 本論文では、ダイナミックに変化するエージェント数に対応可能な拡散モデルベースのプランナーを提案し、少数エージェントで学習して大規模エージェントへ展開できる「train small deploy large」方針を実現する。
  • 共有の拡散モデルに、相互注意(inter-agent attention)計算と時間的畳み込み(temporal convolution)を組み合わせることで、精度を維持しつつスケール性能を高める。
  • 複数のシナリオで検証し、従来のマルチエージェント強化学習手法やヒューリスティック制御と比較して有効性を示している。

Abstract

Learning based multi-robot path planning methods struggle to scale or generalize to changes, particularly variations in the number of robots during deployment. Most existing methods are trained on a fixed number of robots and may tolerate a reduced number during testing, but typically fail when the number increases. Additionally, training such methods for a larger number of agents can be both time consuming and computationally expensive. However, analytical methods can struggle to scale computationally or handle dynamic changes in the environment. In this work, we propose to leverage a diffusion model based planner capable of handling dynamically varying number of agents. Our approach is trained on a limited number of agents and generalizes effectively to larger numbers of agents during deployment. Results show that integrating a single shared diffusion model based planner with dedicated inter-agent attention computation and temporal convolution enables a train small deploy-large paradigm with good accuracy. We validate our method across multiple scenarios and compare the performance with existing multi-agent reinforcement learning techniques and heuristic control based methods.