Neural Backward Reach-Avoid Tubes with MPC Supervision for High-Dimensional Systems: An Application to Safe Spacecraft Docking
arXiv cs.RO / 5/5/2026
📰 NewsDeveloper Stack & InfrastructureIdeas & Deep AnalysisModels & Research
Key Points
- The paper tackles autonomous spacecraft docking by learning control policies that provide formal reach-avoid (collision avoidance plus target reachability) guarantees under coupled, high-dimensional translational-rotational dynamics.
- It proposes a learning-based Backward Reach-Avoid Tube (BRAT) framework that tightly combines Hamilton-Jacobi (HJ) reachability structure with MPC-based supervision to overcome limitations of classical low-dimensional solvers and weaknesses of purely PDE-based learning in tightly coupled docking sets.
- In the offline training stage, the authors train a neural approximation of the HJ value function using PDE losses, enhanced by curriculum-driven MPC supervision to stabilize learning and produce more informative value targets.
- In the online deployment stage, the learned value function is used by two real-time controllers: a value-gradient-driven controller and a terminal MPC that explicitly enforces reachability at the horizon.
- Experiments on a 6D planar docking task and a scaled-up full 13D system show improved success rate and computational efficiency compared with existing methods.
Related Articles

Why Retail Chargeback Recovery Could Be AgentHansa's First Real PMF
Dev.to

Why B2B Revenue-Recovery Casework Looks Like AgentHansa's Best Early PMF
Dev.to

10 Ways AI Has Become Your Invisible Daily Companion in 2026
Dev.to

When a Bottling Line Stops at 2 A.M., the Agent That Wins Is the One That Finds the Right Replacement Part
Dev.to

My ‘Busy’ Button Is a Chat Window: 8 Hours of Sorting & Broccoli Poetry
Dev.to