E2E-Fly: An Integrated Training-to-Deployment System for End-to-End Quadrotor Autonomy

arXiv cs.RO / 4/15/2026

📰 NewsSignals & Early TrendsModels & Research

Key Points

  • 本論文は、シミュレーションで学習したquadrotor用のエンドツーエンド方策を現実へ転移する際の、レンダリング効率・物理モデリング誤差・センサ差・統合基盤不足といった課題に対処するための統合フレームワークE2E-Flyを提案しています。
  • E2E-Flyは高性能シミュレータ(強化学習と微分可能物理学学習に対応)と、一般的なタスク向けに設計された報酬設計を含むフルスタックの学習・検証ワークフローを備えています。
  • 検証は2段階(sim-to-sim転移→hardware-in-the-loop)で行い、さらに現実デプロイではシステム同定・ドメインランダム化・遅延補償・ノイズモデリングによるsim-to-real整合を実施します。
  • 実験では6つのエンドツーエンド制御タスクの学習と、2つの実機quadrotorへの実デプロイで有効性を示しています。

Abstract

Training and transferring learning-based policies for quadrotors from simulation to reality remains challenging due to inefficient visual rendering, physical modeling inaccuracies, unmodeled sensor discrepancies, and the absence of a unified platform integrating differentiable physics learning into end-to-end training. While recent work has demonstrated various end-to-end quadrotor control tasks, few systems provide a systematic, zero-shot transfer pipeline, hindering reproducibility and real-world deployment. To bridge this gap, we introduce E2E-Fly, an integrated framework featuring an agile quadrotor platform coupled with a full-stack training, validation, and deployment workflow. The training framework incorporates a high-performance simulator with support for differentiable physics learning and reinforcement learning, alongside structured reward design tailored to common quadrotor tasks. We further introduce a two-stage validation strategy using sim-to-sim transfer and hardware-in-the-loop testing, and deploy policies onto two physical quadrotor platforms via a dedicated low-level control interface and a comprehensive sim-to-real alignment methodology, encompassing system identification, domain randomization, latency compensation, and noise modeling. To the best of our knowledge, this is the first work to systematically unify differentiable physical learning with training, validation, and real-world deployment for quadrotors. Finally, we demonstrate the effectiveness of our framework for training six end-to-end control tasks and deploy them in the real world.