"Who Am I, and Who Else Is Here?" Behavioral Differentiation Without Role Assignment in Multi-Agent LLM Systems

arXiv cs.AI / 4/2/2026

💬 OpinionIdeas & Deep AnalysisModels & Research

Key Points

  • The paper uses a controlled multi-agent LLM experimental platform (7 heterogeneous LLMs, 208 runs, 13,786 messages) to test whether agents differentiate socially without explicit role assignment.
  • It finds heterogeneous groups develop significantly richer and more diverse behavioral differentiation than homogeneous groups, with quantified increases in differentiation metrics.
  • The study shows group behavior is context- and scaffolding-dependent: agents converge toward more uniform behavior when real model names are revealed and when prompt scaffolding is removed.
  • It observes compensatory interaction patterns when an agent crashes, indicating coordination-like dynamics emerging from multi-agent conversation.
  • Reliability is supported via multi-judge LLM coding with reported Cohen’s kappa agreement and additional human validation on a stratified sample of messages.

Abstract

When multiple large language models interact in a shared conversation, do they develop differentiated social roles or converge toward uniform behavior? We present a controlled experimental platform that orchestrates simultaneous multi-agent discussions among 7 heterogeneous LLMs on a unified inference backend, systematically varying group composition, naming conventions, and prompt structure across 12 experimental series (208 runs, 13,786 coded messages). Each message is independently coded on six behavioral flags by two LLM judges from distinct model families (Gemini 3.1 Pro and Claude Sonnet 4.6), achieving mean Cohen's kappa = 0.78 with conservative intersection-based adjudication. Human validation on 609 randomly stratified messages confirmed coding reliability (mean kappa = 0.73 vs. Gemini). We find that (1) heterogeneous groups exhibit significantly richer behavioral differentiation than homogeneous groups (cosine similarity 0.56 vs. 0.85; p < 10^-5, r = 0.70); (2) groups spontaneously exhibit compensatory response patterns when an agent crashes; (3) revealing real model names significantly increases behavioral convergence (cosine 0.56 to 0.77, p = 0.001); and (4) removing all prompt scaffolding converges profiles to homogeneous-level similarity (p < 0.001). Critically, these behaviors are absent when agents operate in isolation, confirming that behavioral diversity is a structured, reproducible phenomenon driven by the interaction of architectural heterogeneity, group context, and prompt-level scaffolding.