AME-2: Agile and Generalized Legged Locomotion via Attention-Based Neural Map Encoding

arXiv cs.RO / 3/25/2026

💬 OpinionIdeas & Deep AnalysisModels & Research

Key Points

  • 本研究は、障害物の隠蔽(occlusion)や足場が乏しい環境でも、知覚と制御の密な統合により俊敏かつ汎用的な脚ロボット移動を実現するためのAME-2を提案している。
  • AME-2は強化学習(RL)に、注意機構つきのニューラルマップエンコーダを組み込むことで、局所・大域の地形特徴を抽出し、サリент領域に注目する「解釈可能で汎用性のある埋め込み」を制御ポリシーに与える。
  • さらに、深度観測から局所的な地形(高低)と不確実性を推定する学習ベースのマッピング・パイプラインを導入し、オドメトリと融合してノイズや隠蔽に頑健な地形表現を高速に生成する。
  • 並列シミュレーションを活用してオンライン地図作成を行い、シミュレーションから実機への移行(sim-to-real)を支援する学習手法を組み合わせている。
  • クアドロプッドおよびバイペッドの実機・シミュレーション実験で、見たことのない地形への一般化と強い俊敏性が示された。

Abstract

Achieving agile and generalized legged locomotion across terrains requires tight integration of perception and control, especially under occlusions and sparse footholds. Existing methods have demonstrated agility on parkour courses but often rely on end-to-end sensorimotor models with limited generalization and interpretability. By contrast, methods targeting generalized locomotion typically exhibit limited agility and struggle with visual occlusions. We introduce AME-2, a unified reinforcement learning (RL) framework for agile and generalized locomotion that incorporates a novel attention-based map encoder in the control policy. This encoder extracts local and global mapping features and uses attention mechanisms to focus on salient regions, producing an interpretable and generalized embedding for RL-based control. We further propose a learning-based mapping pipeline that provides fast, uncertainty-aware terrain representations robust to noise and occlusions, serving as policy inputs. It uses neural networks to convert depth observations into local elevations with uncertainties, and fuses them with odometry. The pipeline also integrates with parallel simulation so that we can train controllers with online mapping, aiding sim-to-real transfer. We validate AME-2 with the proposed mapping pipeline on a quadruped and a biped robot, and the resulting controllers demonstrate strong agility and generalization to unseen terrains in simulation and in real-world experiments.