Graph2Counsel: Clinically Grounded Synthetic Counseling Dialogue Generation from Client Psychological Graphs

arXiv cs.CL / 4/23/2026

📰 NewsDeveloper Stack & InfrastructureTools & Practical UsageModels & Research

Key Points

  • The paper proposes Graph2Counsel, a framework to generate synthetic mental-health counseling dialogues for LLMs using Client Psychological Graphs (CPGs) that capture relationships among clients’ thoughts, emotions, and behaviors.
  • It addresses limitations of prior synthetic-data approaches by using structured prompting tied to counselor strategies and explicit psychological structure, aiming to improve psychological consistency and realism.
  • Graph2Counsel generates 760 counseling sessions from 76 CPGs across diverse client profiles, and expert evaluations report improvements over prior datasets in specificity, competence, authenticity, conversational flow, and safety.
  • The authors also evaluate prompting methods such as Chain-of-Thought (CoT) and Multi-Agent Feedback, and demonstrate that fine-tuning an open-source model on the dataset improves results on CounselingBench and CounselBench.
  • The code and dataset are released publicly, enabling downstream researchers and practitioners to build and benchmark safer, more realistic counseling dialogue systems.

Abstract

Rising demand for mental health support has increased interest in using Large Language Models (LLMs) for counseling. However, adapting LLMs to this high-risk safety-critical domain is hindered by the scarcity of real-world counseling data due to privacy constraints. Synthetic datasets provide a promising alternative, but existing approaches often rely on unstructured or semi-structured text inputs and overlook structural dependencies between a client's cognitive, emotional, and behavioral states, often producing psychologically inconsistent interactions and reducing data realism and quality. We introduce Graph2Counsel, a framework for generating synthetic counseling sessions grounded in Client Psychological Graphs (CPGs) that encode relationships among clients' thoughts, emotions, and behaviors. Graph2Counsel employs a structured prompting pipeline guided by counselor strategies and CPG, and explores prompting strategies including CoT (Wei et al., 2022) and Multi-Agent Feedback (Li et al., 2025a). Graph2Counsel produces 760 sessions from 76 CPGs across diverse client profiles. In expert evaluation, our dataset outperforms prior datasets on specificity, counselor competence, authenticity, conversational flow, and safety, with substantial inter-annotator agreement (Krippendorff's \alpha = 0.70). Fine-tuning an open-source model on this dataset improves performance on CounselingBench (Nguyen et al., 2025) and CounselBench (Li et al., 2025b), showing downstream utility. We also make our code and data public.