Sumo: Dynamic and Generalizable Whole-Body Loco-Manipulation

arXiv cs.RO / 4/10/2026

💬 OpinionSignals & Early TrendsIdeas & Deep AnalysisModels & Research

Key Points

  • The paper introduces a sim-to-real framework for legged robots to dynamically perform whole-body manipulation of large, heavy objects by steering a pre-trained whole-body control policy at test time with a sample-based planner.
  • It claims strong generalization across multiple dynamic loco-manipulation tasks and object types without additional tuning or retraining, with further performance improvements possible by adjusting the planner’s cost function during testing.
  • Experiments on a Spot quadruped in real-world settings include uprighting a tire heavier than the robot’s nominal lifting capacity and dragging a barrier larger and taller than the robot.
  • The approach is also demonstrated in simulation for humanoid-style loco-manipulation tasks such as opening a door and pushing a table.
  • The authors provide project code and video materials via the project website to support reproduction and further evaluation.

Abstract

This paper presents a sim-to-real approach that enables legged robots to dynamically manipulate large and heavy objects with whole-body dexterity. Our key insight is that by performing test-time steering of a pre-trained whole-body control policy with a sample-based planner, we can enable these robots to solve a variety of dynamic loco-manipulation tasks. Interestingly, we find our method generalizes to a diverse set of objects and tasks with no additional tuning or training, and can be further enhanced by flexibly adjusting the cost function at test time. We demonstrate the capabilities of our approach through a variety of challenging loco-manipulation tasks on a Spot quadruped robot in the real world, including uprighting a tire heavier than the robot's nominal lifting capacity and dragging a crowd-control barrier larger and taller than the robot itself. Additionally, we show that the same approach can be generalized to humanoid loco-manipulation tasks, such as opening a door and pushing a table, in simulation. Project code and videos are available at \href{https://sumo.rai-inst.com/}{https://sumo.rai-inst.com/}.