RoboMatch: A Unified Mobile-Manipulation Teleoperation Platform with Auto-Matching Network Architecture for Long-Horizon Tasks

arXiv cs.RO / 3/27/2026

💬 OpinionSignals & Early TrendsIdeas & Deep AnalysisModels & Research

Key Points

  • The paper introduces RoboMatch, a unified teleoperation platform for mobile manipulation aimed at improving performance on long-horizon tasks in dynamic environments.
  • RoboMatch uses a cockpit-style interface to coordinate a mobile base and dual arms synchronously, improving control precision and data collection efficiency.
  • It proposes the Proprioceptive-Visual Enhanced Diffusion Policy (PVE-DP), which uses discrete wavelet transforms for multi-scale visual features and integrates high-precision end-effector IMU proprioception to boost fine manipulation.
  • The Auto-Matching Network (AMN) architecture decomposes long-horizon tasks into logical sub-sequences and dynamically routes lightweight, pre-trained models for distributed inference.
  • Reported experiments show data collection efficiency gains of 20%+, task success improvements of 20–30% with PVE-DP, and about 40% better long-horizon inference performance with AMN.

Abstract

This paper presents RoboMatch, a novel unified teleoperation platform for mobile manipulation with an auto-matching network architecture, designed to tackle long-horizon tasks in dynamic environments. Our system enhances teleoperation performance, data collection efficiency, task accuracy, and operational stability. The core of RoboMatch is a cockpit-style control interface that enables synchronous operation of the mobile base and dual arms, significantly improving control precision and data collection. Moreover, we introduce the Proprioceptive-Visual Enhanced Diffusion Policy (PVE-DP), which leverages Discrete Wavelet Transform (DWT) for multi-scale visual feature extraction and integrates high-precision IMUs at the end-effector to enrich proprioceptive feedback, substantially boosting fine manipulation performance. Furthermore, we propose an Auto-Matching Network (AMN) architecture that decomposes long-horizon tasks into logical sequences and dynamically assigns lightweight pre-trained models for distributed inference. Experimental results demonstrate that our approach improves data collection efficiency by over 20%, increases task success rates by 20-30% with PVE-DP, and enhances long-horizon inference performance by approximately 40% with AMN, offering a robust solution for complex manipulation tasks. Project website: https://robomatch.github.io