Learning-Based Sparsification of Dynamic Graphs in Robotic Exploration Algorithms

arXiv cs.RO / 4/21/2026

📰 NewsIdeas & Deep AnalysisModels & Research

Key Points

  • Robotic exploration and dynamic path-planning methods often build growing graph structures that accumulate redundant information and degrade performance, motivating graph sparsification during runtime.
  • The paper proposes a transformer-based framework trained with Proximal Policy Optimization (PPO) to prune dynamic graphs while the robot explores.
  • In simulations using an RRT-based frontier exploration setup, the learned policy reduced graph size by up to 96%.
  • The authors report preliminary evidence that the framework can link pruning decisions to exploration outcomes even with sparse and delayed reward signals.
  • Although intelligent pruning can reduce exploration rate versus baselines, it produces the lowest variability in results, indicating more consistent exploration across different environments.

Abstract

Many robotic exploration algorithms rely on graph structures for frontier-based exploration and dynamic path planning. However, these graphs grow rapidly, accumulating redundant information and impacting performance. We present a transformer-based framework trained with Proximal Policy Optimization (PPO) to prune these graphs during exploration, limiting their growth and reducing the accumulation of excess information. The framework was evaluated on simulations of a robotic agent using Rapidly Exploring Random Trees (RRT) to carry out frontier-based exploration, where the learned policy reduces graph size by up to 96%. We find preliminary evidence that our framework learns to associate pruning decisions with exploration outcomes despite sparse, delayed reward signals. We also observe that while intelligent pruning achieves a lower rate of exploration compared to baselines, it yields the lowest standard deviation, producing the most consistent exploration across varied environments. To the best of our knowledge, these results are the first suggesting the viability of RL in sparsification of dynamic graphs used in robotic exploration algorithms.