AI Navigate

PMIScore: An Unsupervised Approach to Quantify Dialogue Engagement

arXiv cs.CL / 3/17/2026

📰 NewsIdeas & Deep AnalysisModels & Research

Key Points

  • PMIScore is an unsupervised metric for quantifying dialogue engagement based on pointwise mutual information conditioned on conversation history.
  • To address the computational intractability of PMI in dialogues, the method uses a dual form of divergence and trains a small neural network guided by a mutual information loss.
  • The approach involves generating positive and negative dialogue pairs, extracting embeddings with large language models, and learning from those pairs.
  • The authors validate PMIScore on synthetic and real-world datasets, demonstrating its effectiveness for PMI estimation and supporting the PMI interpretation as an engagement metric.

Abstract

High dialogue engagement is a crucial indicator of an effective conversation. A reliable measure of engagement could help benchmark large language models, enhance the effectiveness of human-computer interactions, or improve personal communication skills. However, quantifying engagement is challenging, since it is subjective and lacks a "gold standard". This paper proposes PMIScore, an efficient unsupervised approach to quantify dialogue engagement. It uses pointwise mutual information (PMI), which is the probability of generating a response conditioning on the conversation history. Thus, PMIScore offers a clear interpretation of engagement. As directly computing PMI is intractable due to the complexity of dialogues, PMIScore learned it through a dual form of divergence. The algorithm includes generating positive and negative dialogue pairs, extracting embeddings by large language models (LLMs), and training a small neural network using a mutual information loss function. We validated PMIScore on both synthetic and real-world datasets. Our results demonstrate the effectiveness of PMIScore in PMI estimation and the reasonableness of the PMI metric itself.