CHORUS: An Agentic Framework for Generating Realistic Deliberation Data

arXiv cs.AI / 4/23/2026

📰 NewsDeveloper Stack & InfrastructureTools & Practical UsageModels & Research

Key Points

  • The paper introduces CHORUS, an agentic framework that uses LLM-powered actors with behaviorally consistent personas to generate realistic deliberation discussions.
  • It addresses scarcity of large-scale deliberation data by combining memory-driven agents with a Poisson-process-based timing model to mimic heterogeneous user engagement.
  • The framework supports structured tool use so actors can consult external resources, improving realism and enabling integration with interactive web platforms.
  • CHORUS was deployed on the Deliberate platform and evaluated by 30 expert participants, scoring positively on content realism, discussion coherence, and analytical utility.
  • Overall, the results suggest CHORUS can produce high-quality deliberation datasets suitable for online discourse analysis despite access and data-quality constraints elsewhere.

Abstract

Understanding the intricate dynamics of online discourse depends on large-scale deliberation data, a resource that remains scarce across interactive web platforms due to restrictive accessibility policies, ethical concerns and inconsistent data quality. In this paper, we propose Chorus, an agentic framework, which orchestrates LLM-powered actors with behaviorally consistent personas to generate realistic deliberation discussions. Each actor is governed by an autonomous agent equipped with memory of the evolving discussion, while participation timing is governed by a principled Poisson process-based temporal model, which approximates the heterogeneous engagement patterns of real users. The framework is further supported by structured tool usage, enabling actors to access external resources and facilitating integration with interactive web platforms. The framework was deployed on the \textsc{Deliberate} platform and evaluated by 30 expert participants across three dimensions: content realism, discussion coherence and analytical utility, confirming Chorus as a practical tool for generating high-quality deliberation data suitable for online discourse analysis