StableTracker: Learning to Stably Track Target via Differentiable Simulation

arXiv cs.RO / 3/24/2026

💬 OpinionIdeas & Deep AnalysisModels & Research

Key Points

  • StableTracker は、従来手法のような手作りモジュール型パイプラインが抱える計算負荷や累積誤差、また学習ベースの“高レベル軌道推定”に起因する制御の疎結合問題を解決するための学習型制御ポリシーを提案しています。
  • differentiable simulation を用い、バックプロパゲーション・スルー・タイムで学習することで、クアッドロータは任意視点からでも目標を追従し、相対距離を保ちつつ画像内で水平・垂直の双方で目標を視野中心に維持できるとしています。
  • シミュレーションでは、既存の従来手法および学習ベースのベースラインに対して精度・安定性・一般化性能(安全距離、軌道、目標速度の変化)で優れていることを示しています。
  • 実機実験として、オンボードコンピュータ搭載のクアッドロータで提案手法の実用性を検証したと報告しています。

Abstract

Existing FPV object tracking methods heavily rely on handcrafted modular pipelines, which incur high onboard computation and cumulative errors. While learning-based approaches have mitigated computational delays, most still generate only high-level trajectories (position and yaw). This loose coupling with a separate controller sacrifices precise attitude control; consequently, even if target is localized precisely, accurate target estimation does not ensure that the body-fixed camera is consistently oriented toward the target, it still probably degrades and loses target when tracking high-maneuvering target. To address these challenges, we present StableTracker, a learning-based control policy that enables quadrotors to robustly follow a moving target from arbitrary viewpoints. The policy is trained using backpropagation-through-time via differentiable simulation, allowing the quadrotor to keep a fixed relative distance while maintaining the target at the center of the visual field in both horizontal and vertical directions, thereby functioning as an autonomous aerial camera. We compare StableTracker against state-of-the-art traditional algorithms and learning baselines. Simulation results demonstrate superior accuracy, stability, and generalization across varying safe distances, trajectories, and target velocities. Furthermore, real-world experiments on a quadrotor with an onboard computer validate the practicality of the proposed approach.