OpenRC: An Open-Source Robotic Colonoscopy Framework for Multimodal Data Acquisition and Autonomy Research

arXiv cs.RO / 4/7/2026

📰 NewsSignals & Early TrendsIdeas & Deep AnalysisModels & Research

Key Points

  • OpenRC is an open-source robotic colonoscopy framework designed to enable reproducible closed-loop research by capturing the coupled dynamics of operator control, instrument motion, and visual feedback.
  • It retrofits conventional scopes while preserving clinical workflow and supports synchronized recording of video, operator commands, actuation state, and distal tip pose.
  • The authors validate motion consistency and quantify cross-modal latency across sensing streams to support reliable multimodal learning and autonomy experiments.
  • They collected a multimodal dataset with 1,894 teleoperated episodes (~19 hours) across 10 structured task variations, including routine navigation and failure/recovery behaviors.
  • By combining open hardware with an aligned multimodal dataset, OpenRC aims to provide a standardized foundation for research in multimodal robotic colonoscopy and surgical autonomy/VLA-style learning.

Abstract

Colorectal cancer screening critically depends on colonoscopy, yet existing platforms offer limited support for systematically studying the coupled dynamics of operator control, instrument motion, and visual feedback. This gap restricts reproducible closed-loop research in robotic colonoscopy, medical imaging, and emerging vision-language-action (VLA) learning paradigms. To address this challenge, we present OpenRC, an open-source modular robotic colonoscopy framework that retrofits conventional scopes while preserving clinical workflow. The framework supports simultaneous recording of video, operator commands, actuation state, and distal tip pose. We experimentally validated motion consistency and quantified cross-modal latency across sensing streams. Using this platform, we collected a multimodal dataset comprising 1,894 teleoperated episodes ~19 hours across 10 structured task variations of routine navigation, failure events, and recovery behaviors. By unifying open hardware and an aligned multimodal dataset, OpenRC provides a reproducible foundation for research in multimodal robotic colonoscopy and surgical autonomy.