ROSClaw: An OpenClaw ROS 2 Framework for Agentic Robot Control and Interaction

arXiv cs.RO / 3/31/2026

📰 NewsDeveloper Stack & InfrastructureSignals & Early TrendsIdeas & Deep AnalysisModels & Research

Key Points

  • The paper introduces ROSClaw, a model-agnostic executive layer that connects foundation models to ROS 2 robots while decoupling reasoning/LLM backends from robot integration details.
  • ROSClaw provides dynamic capability discovery with standardized affordance injection, multimodal observation normalization, and structured audit logging so that different models or robots can be swapped via configuration.
  • It adds configurable pre-execution action validation within a safety envelope to reduce out-of-policy actions before commands are executed on physical hardware.
  • The authors deploy ROSClaw on three robot types (wheeled, quadruped, humanoid) with four foundation-model backends and find large backend-dependent differences in out-of-policy action proposal rates and distinct physical behaviors.
  • A cross-framework parity protocol against ROSA suggests the executive-layer design materially affects task completion and safety beyond prompt wording, positioning ROSClaw as both infrastructure and a reproducible embodied-AI measurement tool.

Abstract

Foundation models can endow robots with open-ended reasoning, language understanding, and adaptive planning, yet connecting a model to a physical robot today requires bespoke integration that couples perception, actuation, and safety to a single model and platform. We present ROSClaw, a model-agnostic executive layer that integrates the OpenClaw agent runtime with ROS 2, enabling any foundation model to perceive, reason about, and act on any ROS-enabled robot through (i) dynamic capability discovery with standardized affordance injection, (ii) multimodal observation normalization, (iii) pre-execution action validation within a configurable safety envelope, and (iv) structured audit logging. Swapping model backends or robot platforms is a configuration change; tool schemas, safety enforcement, and provenance logging remain invariant. We deploy ROSClaw on three platforms (wheeled, quadruped, humanoid) with four foundation-model backends. Under this controlled substrate, models exhibit up to 4.8 x differences in out-of-policy action proposal rates (3.4 x among frontier models alone) and produce qualitatively distinct physical behaviors from identical commands. A cross-framework parity protocol against ROSA confirms that executive-layer design, not just prompt wording, significantly affects both task completion and safety behavior, establishing ROSClaw as both practical agentic-robot infrastructure and a reproducible measurement instrument for embodied AI.