ScoRe-Flow: Complete Distributional Control via Score-Based Reinforcement Learning for Flow Matching
arXiv cs.RO / 4/14/2026
📰 NewsSignals & Early TrendsIdeas & Deep AnalysisModels & Research
Key Points
- The paper introduces ScoRe-Flow, a score-based reinforcement learning approach for fine-tuning flow matching (FM) policies used in robotic control.
- It addresses a key limitation of prior FM RL methods by modulating the SDE drift using the score (the gradient of log-density) to steer exploration toward high-probability regions and improve stability.
- ScoRe-Flow leverages a closed-form score computation from the velocity field, avoiding auxiliary networks, and additionally predicts variance to decouple control over the mean and variance of stochastic transitions.
- Experimental results show 2.4× faster convergence versus prior flow-based state-of-the-art on D4RL locomotion tasks.
- The method also reports up to 5.4% higher success rates on Robomimic and Franka Kitchen manipulation benchmarks.
Related Articles

Black Hat Asia
AI Business

Big Tech firms are accelerating AI investments and integration, while regulators and companies focus on safety and responsible adoption.
Dev.to

Don't forget, there is more than forgetting: new metrics for Continual Learning
Dev.to

Microsoft MAI-Image-2-Efficient Review 2026: The AI Image Model Built for Production Scale
Dev.to
Bit of a strange question?
Reddit r/artificial