Learning-Based Strategy for Composite Robot Assembly Skill Adaptation

arXiv cs.RO / 4/9/2026

💬 OpinionSignals & Early TrendsIdeas & Deep AnalysisModels & Research

Key Points

  • The paper addresses challenges in contact-rich industrial robot tasks like peg-in-hole assembly caused by geometric tolerances, friction variability, and uncertain contact dynamics, especially with position-controlled arms.
  • It proposes a reusable, encapsulated, skill-based strategy that models assembly as composite skills with explicit pre-, post-, and invariant conditions to support modularity and consistent execution semantics.
  • Adaptation is performed via Residual Reinforcement Learning (RRL), which confines learning to residual refinements inside each skill while keeping the overall skill structure and control flow invariant.
  • The approach is evaluated in MuJoCo on a UR5e robot with a Robotiq gripper, using SAC with JAX, showing robust skill execution across variations.
  • The authors argue the method improves safety and sample efficiency by limiting where and how the policy can change during contact interactions, making it promising for industrial automation.

Abstract

Contact-rich robotic skills remain challenging for industrial robots due to tight geometric tolerances, frictional variability, and uncertain contact dynamics, particularly when using position-controlled manipulators. This paper presents a reusable and encapsulated skill-based strategy for peg-in-hole assembly, in which adaptation is achieved through Residual Reinforcement Learning (RRL). The assembly process is represented using composite skills with explicit pre-, post-, and invariant conditions, enabling modularity, reusability, and well-defined execution semantics across task variations. Safety and sample efficiency are promoted through RRL by restricting adaptation to residual refinements within each skill during contact-rich interactions, while the overall skill structure and execution flow remain invariant. The proposed approach is evaluated in MuJoCo simulation on a UR5e robot equipped with a Robotiq gripper and trained using SAC and JAX. Results demonstrate that the proposed formulation enables robust execution of assembly skills, highlighting its suitability for industrial automation.