Adaptor: Advancing Assistive Teleoperation with Few-Shot Learning and Cross-Operator Generalization

arXiv cs.RO / 4/13/2026

💬 OpinionIdeas & Deep AnalysisModels & Research

共有:

Key Points

Adaptorは、支援的テレオペレーションにおける操作者ごとの癖や熟練度の違いによって起きる軌道分布の不均一性を背景に、クロスオペレータな意図認識の安定化を狙ったfew-shotフレームワークである。
まず前処理段階でノイズ注入による軌道摂動を合成して意図の不確実性をモデル化し、さらにジオメトリに配慮したキーフレーム抽出を行う。
次に政策学習段階では、処理済み軌道をIntention Expertで符号化し、事前学習済みの視覚言語モデルの文脈と融合してAction Expertの行動生成を条件付ける。
実環境およびシミュレーションのベンチマーク実験で、成功率と効率の双方でベースラインを上回り、操作者の熟練度が異なっても分散が小さい（頑健なクロスオペレータ汎化）ことを示している。

Abstract

Assistive teleoperation enhances efficiency via shared control, yet inter-operator variability, stemming from diverse habits and expertise, induces highly heterogeneous trajectory distributions that undermine intent recognition stability. We present Adaptor, a few-shot framework for robust cross-operator intent recognition. The Adaptor bridges the domain gap through two stages: (i) preprocessing, which models intent uncertainty by synthesizing trajectory perturbations via noise injection and performs geometry-aware keyframe extraction; and (ii) policy learning, which encodes the processed trajectories with an Intention Expert and fuses them with the pre-trained vision-language model context to condition an Action Expert for action generation. Experiments on real-world and simulated benchmarks demonstrate that Adaptor achieves state-of-the-art performance, improving success rates and efficiency over baselines. Moreover, the method exhibits low variance across operators with varying expertise, demonstrating robust cross-operator generalization.