Surprisal Minimisation over Goal-directed Alternatives Predicts Production Choice in Dialogue

arXiv cs.CL / 5/4/2026

💬 OpinionSignals & Early TrendsIdeas & Deep AnalysisModels & Research

Key Points

  • The paper treats utterance production as a probabilistic, cost-sensitive choice among contextual alternatives defined in information-theoretic terms.
  • It distinguishes goal-directed alternatives (which realize a fixed communicative intent) from goal-agnostic, plausibility-based alternatives, enabling different speaker/listener interpretations of cost measures.
  • The authors propose a method to generate both kinds of alternative sets using language models, and then analyze production choices in open-ended dialogue under deterministic and probabilistic cost minimisation.
  • Their results show that surprisal minimisation relative to goal-directed alternatives best predicts production choices across both evaluation setups.
  • The study argues that optimizing over LM-generated alternatives conditioned on goals and context offers a principled framework for examining both speaker and listener pressures in natural language production.

Abstract

We model utterance production as probabilistic cost-sensitive choice over contextual alternatives, using information-theoretic notions of cost. We distinguish between goal-directed alternatives that realise a fixed communicative intent and goal-agnostic alternatives defined only by contextual plausibility, allowing us to derive speaker- and listener-oriented interpretations of different cost measures. We present a procedure to generate both types of alternative sets using language models. Analysing production choices in open-ended dialogue under both deterministic and probabilistic cost minimisation, we find that surprisal minimisation relative to goal-directed alternatives provides the strongest predictive account under both analyses. By contrast, uniform information density and length-based costs exhibit weaker and less consistent predictive power across conditions. More broadly, our study suggests that alternative-conditioned optimisation with LM-generated alternatives provides a principled framework for studying speaker and listener pressures in naturalistic language production.