Behavioral Score Diffusion: Model-Free Trajectory Planning via Kernel-Based Score Estimation from Data

arXiv cs.RO / 4/2/2026

💬 OpinionIdeas & Deep AnalysisModels & Research

Key Points

  • The paper introduces Behavioral Score Diffusion (BSD), a diffusion-based trajectory planner that is training-free and model-free by estimating the diffusion score directly from a pre-collected trajectory dataset.
  • BSD performs multi-scale, kernel-weighted trajectory retrieval using three factors—diffusion proximity, state context, and goal relevance—and applies Nadaraya-Watson estimation at each denoising step to produce denoised trajectories.
  • The diffusion noise schedule is used to control kernel bandwidths, yielding a coarse-to-fine regression behavior that captures global-to-local nonlinear dynamics without dynamics model linearization or parametric assumptions.
  • Safety is maintained through shielded rollouts on the kernel-estimated state trajectories, using an approach aligned with existing model-based safety mechanisms.
  • On four robotic parking tasks spanning 3D to 6D state spaces, BSD with fixed bandwidth reaches about 98.5% of a model-based baseline average reward using only 1,000 trajectories and significantly improves over nearest-neighbor retrieval, highlighting the importance of diffusion denoising.

Abstract

Diffusion-based trajectory optimization has emerged as a powerful planning paradigm, but existing methods require either learned score networks trained on large datasets or analytical dynamics models for score computation. We introduce \emph{Behavioral Score Diffusion} (BSD), a training-free and model-free trajectory planner that computes the diffusion score function directly from a library of trajectory data via kernel-weighted estimation. At each denoising step, BSD retrieves relevant trajectories using a triple-kernel weighting scheme -- diffusion proximity, state context, and goal relevance -- and computes a Nadaraya-Watson estimate of the denoised trajectory. The diffusion noise schedule naturally controls kernel bandwidths, creating a multi-scale nonparametric regression: broad averaging of global behavioral patterns at high noise, fine-grained local interpolation at low noise. This coarse-to-fine structure handles nonlinear dynamics without linearization or parametric assumptions. Safety is preserved by applying shielded rollout on kernel-estimated state trajectories, identical to existing model-based approaches. We evaluate BSD on four robotic systems of increasing complexity (3D--6D state spaces) in a parking scenario. BSD with fixed bandwidth achieves 98.5\% of the model-based baseline's average reward across systems while requiring no dynamics model, using only 1{,}000 pre-collected trajectories. BSD substantially outperforms nearest-neighbor retrieval (18--63\% improvement), confirming that the diffusion denoising mechanism is essential for effective data-driven planning.