PSY-STEP: Structuring Therapeutic Targets and Action Sequences for Proactive Counseling Dialogue Systems

arXiv cs.AI / 4/7/2026

📰 NewsSignals & Early TrendsIdeas & Deep AnalysisModels & Research

Key Points

  • The paper introduces PSY-STEP (STEP), a dataset designed to represent CBT counseling by pairing automatic thoughts with fine-grained action-level dialogue sequences for therapeutic targeting.
  • It presents STEPPER, an agent trained on PSY-STEP to proactively elicit automatic thoughts and perform cognitively grounded interventions during counseling dialogues.
  • The authors further improve STEPPER using preference learning from simulated, synthesized counseling sessions to enhance both decision accuracy and empathetic responsiveness.
  • Evaluation results on CBT-aligned benchmarks indicate STEPPER produces more clinically grounded, coherent, and personalized counseling than strong baseline models, improving counselor competence without causing emotional disruption.

Abstract

Cognitive Behavioral Therapy (CBT) aims to identify and restructure automatic negative thoughts pertaining to involuntary interpretations of events, yet existing counseling agents struggle to identify and address them in dialogue settings. To bridge this gap, we introduce STEP, a dataset that models CBT counseling by explicitly reflecting automatic thoughts alongside dynamic, action-level counseling sequences. Using this dataset, we train STEPPER, a counseling agent that proactively elicits automatic thoughts and executes cognitively grounded interventions. To further enhance both decision accuracy and empathic responsiveness, we refine STEPPER through preference learning based on simulated, synthesized counseling sessions. Extensive CBT-aligned evaluations show that STEPPER delivers more clinically grounded, coherent, and personalized counseling compared to other strong baseline models, and achieves higher counselor competence without inducing emotional disruption.