AI-Enabled Image-Based Hybrid Vision/Force Control of Tendon-Driven Aerial Continuum Manipulators

arXiv cs.RO / 4/22/2026

📰 NewsDeveloper Stack & InfrastructureIdeas & Deep AnalysisModels & Research

Key Points

  • The paper proposes an AI-enabled cascaded hybrid vision/force control framework for tendon-driven aerial continuum manipulators, using constant-strain modeling in SE(3) to couple motion and sensing.
  • It stabilizes image feature error during contact by combining fast fixed-time sliding mode control with a radial basis function neural network to handle uncertainties from a monocular eye-in-hand camera and force sensors.
  • The approach performs rapid online learning of vision/force uncertainties without offline training, improving adaptability to changing conditions.
  • Visual features are extracted using a graph neural network with line-based visual servoing, avoiding heuristic geometric line extractors while simultaneously regulating both desired interaction force and image feature error.
  • Simulations and experiments benchmark the controller against rigid-arm aerial manipulation baselines, showing robust performance across different scenarios and feature-extraction strategies.

Abstract

This paper presents an AI-enabled cascaded hybrid vision/force control framework for tendon-driven aerial continuum manipulators based on constant-strain modeling in SE(3) as a coupled system. The proposed controller is designed to enable autonomous, physical interaction with a static environment while stabilizing the image feature error. The developed strategy combines the cascaded fast fixed-time sliding mode control and a radial basis function neural network to cope with the uncertainties in the image acquired by the eye-in-hand monocular camera and the measurements from the force sensing apparatus. This ensures rapid, online learning of the vision- and force-related uncertainties without requiring offline training. Furthermore, the features are extracted via a state-of-the-art graph neural network architecture employed by a visual servoing framework using line features, rather than relying on heuristic geometric line extractors, to concurrently contribute to tracking the desired normal interaction force during contact and regulating the image feature error. A comparative study benchmarks the proposed controller against established rigid-arm aerial manipulation methods, evaluating robustness across diverse scenarios and feature extraction strategies. The simulation and experimental results showcase the effectiveness of the proposed methodology under various initial conditions and demonstrate robust performance in executing manipulation tasks.