Automating Manual Tasks through Intuitive Robot Programming and Cognitive Robotics

arXiv cs.RO / 4/8/2026

💬 Opinion

Key Points

  • The paper proposes an intuitive end-user way to program robots by mapping natural language and human gestures into robot executable programs using LLMs and computer vision.

Abstract

This paper presents a novel concept for intuitive end-user programming of robots, inspired by natural interaction between humans. Natural language and supportive gestures are translated into robot programs using large language models (LLMs) and computer vision (CV). Through equally natural system feedback in the form of clarification questions and visual representations, the generated program can be reviewed and adjusted, thereby ensuring safety, transparency, and user acceptance.