Dynamic Neural Potential Field: Online Trajectory Optimization in the Presence of Moving Obstacles

arXiv cs.RO / 3/26/2026

💬 OpinionIdeas & Deep AnalysisModels & Research

共有:

Key Points

The paper introduces Dynamic Neural Potential Field (NPField-GPT), a learning-enhanced MPC framework for real-time, safe robot trajectory optimization in human environments with moving obstacles.
NPField-GPT couples a Transformer-based predictor of footprint-aware repulsive potentials with a classical sequential quadratic MPC solver (via L4CasADi), enabling differentiable potential forecasts over a planning horizon.
Experiments in dynamic indoor benchmarks (BenchMR) and on a Husky UGV in office corridors show NPField-GPT generates more efficient and safer trajectories when obstacle motion changes.
The authors compare against baselines that treat dynamic scenes as static-map sequences (NPField-StaticMLP) or predict potential sequences with an MLP (NPField-DynamicMLP), noting latency trade-offs where simpler models can be faster.
Code and trained models are released publicly, and results are also contrasted with CIAO* and MPPI baselines, highlighting a Transformer+MPC “synergy” that preserves model-based planning stability while learning collision-risk spatiotemporal patterns.

Abstract

Generalist robot policies must operate safely and reliably in everyday human environments such as homes, offices, and warehouses, where people and objects move unpredictably. We present Dynamic Neural Potential Field (NPField-GPT), a learning-enhanced model predictive control (MPC) framework that couples classical optimization with a Transformer-based predictor of footprint-aware repulsive potentials. Given an occupancy sub-map, robot footprint, and optional dynamic-obstacle cues, our NPField-GPT model forecasts a horizon of differentiable potentials that are injected into a sequential quadratic MPC program via L4CasADi, yielding real-time, constraint-aware trajectory optimization. We additionally study two baselines: NPField-StaticMLP, where a dynamic scene is treated as a sequence of static maps; and NPField-DynamicMLP, which predicts the future potential sequence in parallel with an MLP. In dynamic indoor scenarios from BenchMR and on a Husky UGV in office corridors, NPField-GPT produces more efficient and safer trajectories under motion changes, while StaticMLP/DynamicMLP offer lower latency. We also compare with the CIAO* and MPPI baselines. Across methods, the Transformer+MPC synergy preserves the transparency and stability of model-based planning while learning only the part that benefits from data: spatiotemporal collision risk. Code and trained models are available at https://github.com/CognitiveAISystems/Dynamic-Neural-Potential-Field

Speaking of VoxtralResearchVoxtral TTS: A frontier, open-weights text-to-speech model that’s fast, instantly adaptable, and produces lifelike speech for voice agents.

Mistral AI Blog

Anyone who has any common sense knows that AI agents in marketing just don’t exist.

Dev.to

How to Use MiMo V2 API for Free in 2026: Complete Guide

Dev.to

The Agent Memory Problem Nobody Solves: A Practical Architecture for Persistent Context

Dev.to

From Chaos to Compliance: AI Automation for the Mobile Kitchen

Dev.to

Dynamic Neural Potential Field: Online Trajectory Optimization in the Presence of Moving Obstacles

Key Points

Abstract

Related Articles

Speaking of VoxtralResearchVoxtral TTS: A frontier, open-weights text-to-speech model that’s fast, instantly adaptable, and produces lifelike speech for voice agents.

Anyone who has any common sense knows that AI agents in marketing just don’t exist.

How to Use MiMo V2 API for Free in 2026: Complete Guide

The Agent Memory Problem Nobody Solves: A Practical Architecture for Persistent Context

From Chaos to Compliance: AI Automation for the Mobile Kitchen

関連おすすめサービス

Notta搭載AI議事録イヤホン ZENCHORD1

AI搭載ボイスレコーダー Plaud

画像高画質化AIツール Aiarty Image Enhancer