Uncertainty as a Planning Signal: Multi-Turn Decision Making for Goal-Oriented Conversation

arXiv cs.CL / 4/7/2026

📰 NewsSignals & Early TrendsIdeas & Deep AnalysisModels & Research

Key Points

  • The paper addresses goal-oriented conversational agents that must make multi-turn decisions under uncertainty about user intent, balancing information gathering with timely commitment to targets.
  • It identifies a gap in existing methods: schema-based structured planning can be rigid, while LLM-based approaches often struggle with long-horizon coordination between probing and committing.
  • The authors propose the Conversation Uncertainty-aware Planning (CUP) framework, combining a language model that proposes actions with a structured planner that evaluates long-term effects on uncertainty reduction.
  • Experiments on multiple conversational benchmarks report that CUP improves success rates and can do so with fewer interaction turns than prior approaches.
  • Additional analysis suggests the uncertainty-aware planning leads to more efficient information acquisition and earlier, more confident commitment to goals.

Abstract

Goal-oriented conversational systems require making sequential decisions under uncertainty about the user's intent, where the algorithm must balance information acquisition and target commitment over multiple turns. Existing approaches address this challenge from different perspectives: structured methods enable multi-step planning but rely on predefined schemas, while LLM-based approaches support flexible interactions but lack long-horizon decision making, resulting in poor coordination between information acquisition and target commitment. To address this limitation, we formulate goal-oriented conversation as an uncertainty-aware sequential decision problem, where uncertainty serves as a guiding signal for multi-turn decision making. We propose a Conversation Uncertainty-aware Planning framework (CUP) that integrates language models with structured planning: a language model proposes feasible actions, and a planner evaluates their long-term impact on uncertainty reduction. Experiments on multiple conversational benchmarks show that CUP consistently improves success rates while requiring fewer interaction turns. Further analysis demonstrates that uncertainty-aware planning contributes to more efficient information acquisition and earlier confident commitment.