A Survey on LLM-based Conversational User Simulation

arXiv cs.CL / 4/29/2026

💬 OpinionIdeas & Deep AnalysisModels & Research

Key Points

  • The paper surveys recent advances in LLM-based conversational user simulation, highlighting how large language models enable higher-fidelity synthetic dialogue.
  • It proposes a new taxonomy that organizes simulation approaches by user granularity and by simulation objectives.
  • The authors systematically review core methods used in this area and the evaluation methodologies for measuring simulation quality.
  • The work also identifies open challenges to guide future research and provides a unified framework for organizing prior studies.
  • The survey is intended to keep the research community up to date on the latest developments in conversational user simulation.

Abstract

User simulation has long played a vital role in computer science due to its potential to support a wide range of applications. Language, as the primary medium of human communication, forms the foundation of social interaction and behavior. Consequently, simulating conversational behavior has become a key area of study. Recent advancements in large language models (LLMs) have significantly catalyzed progress in this domain by enabling high-fidelity generation of synthetic user conversation. In this paper, we survey recent advancements in LLM-based conversational user simulation. We introduce a novel taxonomy covering user granularity and simulation objectives. Additionally, we systematically analyze core techniques and evaluation methodologies. We aim to keep the research community informed of the latest advancements in conversational user simulation and to further facilitate future research by identifying open challenges and organizing existing work under a unified framework.