SMP: Reusable Score-Matching Motion Priors for Physics-Based Character Control
arXiv cs.RO / 4/28/2026
📰 NewsIdeas & Deep AnalysisModels & Research
Key Points
- The paper introduces Score-Matching Motion Priors (SMP), a method for learning reusable, task-agnostic motion reward priors for physics-based character control.
- Unlike prior adversarial imitation learning approaches that typically require retraining per controller and retaining reference motion data, SMP can be pre-trained once on motion data and then reused without changing the model.
- SMP leverages pre-trained motion diffusion models and score distillation sampling (SDS) to create reward functions that can remain frozen while training new control policies for downstream tasks.
- Experiments on physically simulated humanoids show SMP can be adapted into style-specific priors from a general large-scale motion prior and can even compose multiple styles to generate new ones not present in the original dataset.
- The authors report that the motions produced by SMP are competitive with state-of-the-art adversarial imitation learning methods across a broad set of control tasks.


