M2HRI: An LLM-Driven Multimodal Multi-Agent Framework for Personalized Human-Robot Interaction

arXiv cs.RO / 4/15/2026

📰 NewsSignals & Early TrendsIdeas & Deep AnalysisModels & Research

Key Points

  • The paper introduces M2HRI, an LLM-driven multimodal multi-agent framework for multi-robot human-robot interaction that treats robots as distinct individuals rather than functionally identical units.
  • It equips each robot with LLM-based personality traits and long-term memory, and uses a coordination mechanism conditioned on those individual differences.
  • In a controlled user study with 105 participants, LLM-derived personality traits were found to be distinguishable and to improve interaction quality.
  • The study also shows that long-term memory boosts personalization and users’ preference awareness, while centralized coordination reduces behavioral overlap and improves overall interaction quality.
  • The results conclude that both agent individuality and structured coordination are key to coherent and socially appropriate multi-agent HRI.

Abstract

Multi-robot systems hold significant promise for social environments such as homes and hospitals, yet existing multi-robot works treat robots as functionally identical, overlooking how robots individual identity shape user perception and how coordination shapes multi-robot behavior when such individuality is present. To address this, we introduce M2HRI, a multimodal multi-agent framework built on large language models that equips each robot with distinct personality and long-term memory, alongside a coordination mechanism conditioned on these differences. In a controlled user study (n = 105) in a multi-agent human-robot interaction (HRI) scenario, we find that LLM-driven personality traits are significantly distinguishable and enhance interaction quality, long-term memory improves personalization and preference awareness, and centralized coordination significantly reduces overlap while improving overall interaction quality. Together, these results demonstrate that both agent individuality and structured coordination are essential for coherent and socially appropriate multi-agent HRI. Project website and code are available at https://project-m2hri.github.io/.