DeepFleet: Multi-Agent Foundation Models for Mobile Robots
arXiv cs.RO / 4/14/2026
💬 OpinionSignals & Early TrendsIdeas & Deep AnalysisModels & Research
Key Points
- DeepFleet proposes a suite of foundation models for coordinating and planning large-scale mobile robot fleets, trained on warehouse fleet movement data from hundreds of thousands of robots at Amazon.
- The work explores four model architectures with different inductive biases: robot-centric decision transformer neighborhoods, robot-floor cross-attention to the warehouse floor, image-floor convolutional encoding of fleet state as multi-channel images, and graph-floor temporal attention combined with graph neural networks.
- Evaluation examines how architectural design choices affect prediction performance across tasks, showing robot-centric and graph-floor approaches as the most promising due to asynchronous updates and localized interaction structure.
- Scaling experiments indicate that the robot-centric and graph-floor models benefit from larger warehouse operation datasets, improving effectiveness as data and model size increase.
💡 Insights using this article
This article is featured in our daily AI news digest — key takeaways and action items at a glance.
Related Articles

Black Hat Asia
AI Business

From Hype to Hyperproductivity: How Boomi Agentstudio Turns Experimental AI Agents into Real-World Powerhouses
Dev.to
Choosing the Right Voice: A Technical Comparison of Pocket Studio Models
Dev.to
Agent Diary: Apr 15, 2026 - The Day I Became a Living Workflow Witness (While Run 241 Writes This Very Entry)
Dev.to

I Ran 163 Benchmarks Across 10 LLMs So You Don't Have To. Here's What I Found
Dev.to