Dynamic Neural Potential Field: Online Trajectory Optimization in the Presence of Moving Obstacles

arXiv cs.RO / 3/26/2026

💬 OpinionIdeas & Deep AnalysisModels & Research

Key Points

  • The paper introduces Dynamic Neural Potential Field (NPField-GPT), a learning-enhanced MPC framework for real-time, safe robot trajectory optimization in human environments with moving obstacles.
  • NPField-GPT couples a Transformer-based predictor of footprint-aware repulsive potentials with a classical sequential quadratic MPC solver (via L4CasADi), enabling differentiable potential forecasts over a planning horizon.
  • Experiments in dynamic indoor benchmarks (BenchMR) and on a Husky UGV in office corridors show NPField-GPT generates more efficient and safer trajectories when obstacle motion changes.
  • The authors compare against baselines that treat dynamic scenes as static-map sequences (NPField-StaticMLP) or predict potential sequences with an MLP (NPField-DynamicMLP), noting latency trade-offs where simpler models can be faster.
  • Code and trained models are released publicly, and results are also contrasted with CIAO* and MPPI baselines, highlighting a Transformer+MPC “synergy” that preserves model-based planning stability while learning collision-risk spatiotemporal patterns.

Abstract

Generalist robot policies must operate safely and reliably in everyday human environments such as homes, offices, and warehouses, where people and objects move unpredictably. We present Dynamic Neural Potential Field (NPField-GPT), a learning-enhanced model predictive control (MPC) framework that couples classical optimization with a Transformer-based predictor of footprint-aware repulsive potentials. Given an occupancy sub-map, robot footprint, and optional dynamic-obstacle cues, our NPField-GPT model forecasts a horizon of differentiable potentials that are injected into a sequential quadratic MPC program via L4CasADi, yielding real-time, constraint-aware trajectory optimization. We additionally study two baselines: NPField-StaticMLP, where a dynamic scene is treated as a sequence of static maps; and NPField-DynamicMLP, which predicts the future potential sequence in parallel with an MLP. In dynamic indoor scenarios from BenchMR and on a Husky UGV in office corridors, NPField-GPT produces more efficient and safer trajectories under motion changes, while StaticMLP/DynamicMLP offer lower latency. We also compare with the CIAO* and MPPI baselines. Across methods, the Transformer+MPC synergy preserves the transparency and stability of model-based planning while learning only the part that benefits from data: spatiotemporal collision risk. Code and trained models are available at https://github.com/CognitiveAISystems/Dynamic-Neural-Potential-Field