MVAdapt: Zero-Shot Multi-Vehicle Adaptation for End-to-End Autonomous Driving

arXiv cs.AI / 4/15/2026

💬 OpinionIdeas & Deep AnalysisModels & Research

Key Points

  • E2E 自動運転モデルは評価時の固定車両に強く依存し、車両サイズ・質量・駆動系の違いによって性能が落ちる「vehicle-domain gap」が問題になると指摘しています。
  • MVAdapt は TransFuser++ のシーンエンコーダを凍結し、車両物理特性を取り込む軽量な physics encoder と cross-attention でシーン特徴を車両特性に条件付けしてウェイポイントをデコードする適応フレームワークです。
  • CARLA Leaderboard 1.0 で、単純な転移や multi-embodiment のベースラインよりも in-distribution と未見車両の両方で改善し、複数の車両での強いゼロショット転移と、厳しい物理外れ値に対する少データの few-shot キャリブレーションの両方を示しています。
  • 物理に明示的に条件付けることが、自動運転 E2E モデルの転移性向上に有効だという結論で、コードも公開されています。

Abstract

End-to-End (E2E) autonomous driving models are usually trained and evaluated with a fixed ego-vehicle, even though their driving policy is implicitly tied to vehicle dynamics. When such a model is deployed on a vehicle with different size, mass, or drivetrain characteristics, its performance can degrade substantially; we refer to this problem as the vehicle-domain gap. To address it, we propose MVAdapt, a physics-conditioned adaptation framework for multi-vehicle E2E driving. MVAdapt combines a frozen TransFuser++ scene encoder with a lightweight physics encoder and a cross-attention module that conditions scene features on vehicle properties before waypoint decoding. In the CARLA Leaderboard 1.0 benchmark, MVAdapt improves over naive transfer and multi-embodiment adaptation baselines on both in-distribution and unseen vehicles. We further show two complementary behaviors: strong zero-shot transfer on many unseen vehicles, and data-efficient few-shot calibration for severe physical outliers. These results suggest that explicitly conditioning E2E driving policies on vehicle physics is an effective step toward more transferable autonomous driving models. All codes are available at https://github.com/hae-sung-oh/MVAdapt