Refining Compositional Diffusion for Reliable Long-Horizon Planning

arXiv cs.RO / 5/6/2026

📰 NewsIdeas & Deep AnalysisModels & Research

Key Points

  • Compositional diffusion planning can produce long-horizon trajectories by stitching short-horizon segments, but it often fails under multimodal local distributions due to mode-averaging that yields infeasible or incoherent plans.
  • The paper introduces Refining Compositional Diffusion (RCD), a training-free guidance approach that steers compositional sampling toward globally coherent, high-density trajectories.
  • RCD uses a pretrained diffusion model’s self-reconstruction error as a proxy for the log-density of composed plans and adds an overlap consistency term to enforce agreement at segment boundaries.
  • Experiments on difficult long-horizon benchmarks from OGBench (locomotion, object manipulation, and pixel-based observations) show that RCD outperforms existing compositional methods and reduces mode-averaging effects.

Abstract

Compositional diffusion planning generates long-horizon trajectories by stitching together overlapping short-horizon segments through score composition. However, when local plan distributions are multimodal, existing compositional methods suffer from mode-averaging, where averaging incompatible local modes leads to plans that are neither locally feasible nor globally coherent. We propose Refining Compositional Diffusion (RCD), a training-free guidance method that steers compositional sampling toward high-density, globally coherent plans. RCD leverages the self-reconstruction error of a pretrained diffusion model as a proxy for the log-density of composed plans, combined with an overlap consistency term that enforces consistency at segment boundaries. We show that the combined guidance concentrates sampling on high-density plans that mitigate mode-averaging. Experiments on challenging long-horizon tasks from OGBench, including locomotion, object manipulation, and pixel-based observations, demonstrate that RCD consistently outperforms existing methods.