No More Marching: Learning Humanoid Locomotion for Short-Range SE(2) Targets
arXiv cs.RO / 4/23/2026
💬 OpinionIdeas & Deep AnalysisModels & Research
Key Points
- The paper addresses how humanoid robots can efficiently reach short-range target poses in the SE(2) space with transitions that are fast, robust, and energy efficient.
- It criticizes prior learning-based locomotion methods for focusing on velocity tracking rather than direct pose reaching, which can cause inefficient “marching” behavior in short tasks.
- The authors propose a reinforcement learning framework with a new constellation-based reward function that explicitly optimizes motion toward SE(2) target poses.
- They introduce a benchmarking setup that evaluates energy use, time-to-target, and footstep count across a range of SE(2) goals.
- Experiments indicate the method outperforms standard baselines and transfers successfully from simulation to real hardware, underscoring the value of task-targeted reward design.
Related Articles

Just what the doctor ordered: how AI could help China bridge the medical resources gap
SCMP Tech
Why don't Automatic speech Recognition models use prompting? [D]
Reddit r/MachineLearning

Automating Advanced Customization in Your Music Studio
Dev.to

CoTracker3: Simpler and Better Point Tracking by Pseudo-Labelling Real Videos
Dev.to

My AI Agent Over-Corrected Itself — So I Built Metabolic Regulation
Dev.to