AI Navigate

Accurate Open-Loop Control of a Soft Continuum Robot Through Visually Learned Latent Representations

arXiv cs.RO / 3/23/2026

📰 NewsTools & Practical UsageModels & Research

Key Points

  • The paper introduces open-loop control of a soft continuum robot using video-learned latent dynamics via Visual Oscillator Networks (VONs) and an attention broadcast decoder (ABCD) to produce interpretable 2D latents.
  • It performs single-shooting optimal control in latent space to track image-specified waypoints without camera feedback, using an interactive SCR live simulator to map targets to latent waypoints.
  • Among evaluated dynamics, ABCD-based models reduce image-space tracking error with the VON-ABCD Koopman variant achieving the lowest MSEs.
  • An ablation study and simulation stress tests show stable static holding, extrapolated equilibria, and relaxation to rest state, marking the first demonstration that interpretable, video-learned latent dynamics enable reliable long-horizon open-loop control of an SCR.

Abstract

This work addresses open-loop control of a soft continuum robot (SCR) from video-learned latent dynamics. Visual Oscillator Networks (VONs) from previous work are used, that provide mechanistically interpretable 2D oscillator latents through an attention broadcast decoder (ABCD). Open-loop, single-shooting optimal control is performed in latent space to track image-specified waypoints without camera feedback. An interactive SCR live simulator enables design of static, dynamic, and extrapolated targets and maps them to model-specific latent waypoints. On a two-segment pneumatic SCR, Koopman, MLP, and oscillator dynamics, each with and without ABCD, are evaluated on setpoint and dynamic trajectories. ABCD-based models consistently reduce image-space tracking error. The VON and ABCD-based Koopman models attains the lowest MSEs. Using an ablation study, we demonstrate that several architecture choices and training settings contribute to the open-loop control performance. Simulation stress tests further confirm static holding, stable extrapolated equilibria, and plausible relaxation to the rest state. To the best of our knowledge, this is the first demonstration that interpretable, video-learned latent dynamics enable reliable long-horizon open-loop control of an SCR.