Unified Generation-Refinement Planning: Bridging Guided Flow Matching and Sampling-Based MPC for Social Navigation

arXiv cs.RO / 3/24/2026

💬 OpinionIdeas & Deep AnalysisModels & Research

Key Points

  • The paper addresses robust robot planning in human-centric dynamic environments by unifying a learning-based trajectory generator with an optimization-based controller under safety and real-time constraints.
  • It proposes a bidirectional loop where reward-guided conditional flow matching (CFM) generates diverse trajectory priors for model predictive path integral (MPPI) refinement, and the resulting MPPI plans warm-start subsequent CFM generation.
  • Using autonomous social navigation as the main application, the authors report improved trade-offs among safety, task performance, and computation time while maintaining real-time adaptability.
  • The work is framed as a way to mitigate common weaknesses of optimization planners (initialization sensitivity in dynamic settings) and learning-based planners (less reliable constraint satisfaction).

Abstract

Robust robot planning in dynamic, human-centric environments remains challenging due to multimodal uncertainty, the need for real-time adaptation, and safety requirements. Optimization-based planners enable explicit constraint handling but can be sensitive to initialization and struggle in dynamic settings. Learning-based planners capture multimodal solution spaces more naturally, but often lack reliable constraint satisfaction. In this paper, we introduce a unified generation-refinement framework that combines reward-guided conditional flow matching (CFM) with model predictive path integral (MPPI) control. Our key idea is a bidirectional information exchange between generation and optimization: reward-guided CFM produces diverse, informed trajectory priors for MPPI refinement, while the optimized MPPI trajectory warm-starts the next CFM generation step. Using autonomous social navigation as a motivating application, we demonstrate that the proposed approach improves the trade-off between safety, task performance, and computation time, while adapting to dynamic environments in real-time. The source code is publicly available at https://cfm-mppi.github.io.