EBuddy: a workflow orchestrator for industrial human-machine collaboration

arXiv cs.RO / 3/31/2026

💬 OpinionSignals & Early TrendsIdeas & Deep AnalysisTools & Practical UsageModels & Research

Key Points

  • EBuddy is presented as a voice-guided workflow orchestrator designed to support scalable, natural human–machine collaboration in industrial, tool-intensive tasks.
  • The system converts expert practice into a finite state machine (FSM) that constrains interpretation and execution using an interpretable runtime decision frame (current state and admissible actions).
  • EBuddy coordinates heterogeneous resources such as GUI-driven software and collaborative robots, using fully voice-based interaction via automatic speech recognition and intent understanding.
  • In an industrial pilot for impeller blade inspection and repair preparation for directed energy deposition (DED), human–robot collaboration reportedly reduced end-to-end process duration while maintaining repeatability and reducing operator burden.
  • The paper emphasizes modular workflow artifacts as a mechanism to prevent quality degradation caused by ad hoc procedure reconstruction across operators and sessions.

Abstract

This paper presents EBuddy, a voice-guided workflow orchestrator for natural human-machine collaboration in industrial environments. EBuddy targets a recurrent bottleneck in tool-intensive workflows: expert know-how is effective but difficult to scale, and execution quality degrades when procedures are reconstructed ad hoc across operators and sessions. EBuddy operationalizes expert practice as a finite state machine (FSM) driven application that provides an interpretable decision frame at runtime (current state and admissible actions), so that spoken requests are interpreted within state-grounded constraints, while the system executes and monitors the corresponding tool interactions. Through modular workflow artifacts, EBuddy coordinates heterogeneous resources, including GUI-driven software and a collaborative robot, leveraging fully voice-based interaction through automatic speech recognition and intent understanding. An industrial pilot on impeller blade inspection and repair preparation for directed energy deposition (DED), realized by human-robot collaboration, shows substantial reductions in end-to-end process duration across onboarding, 3D scanning and processing, and repair program generation, while preserving repeatability and low operator burden.