What Makes a Good Response? An Empirical Analysis of Quality in Qualitative Interviews

arXiv cs.CL / 4/8/2026

💬 OpinionSignals & Early TrendsIdeas & Deep AnalysisModels & Research

Key Points

  • The paper proposes and empirically tests 10 measures of qualitative interview response quality to see which ones truly predict whether responses contribute to study findings.
  • It introduces the Qualitative Interview Corpus, a new dataset of 343 interview transcripts containing 16,940 participant responses drawn from 14 real research projects.
  • The strongest predictor of response quality is direct relevance to a key research question.
  • Two widely used NLP-style metrics—clarity and surprisal-based informativeness—are found not to be predictive of response quality.
  • The authors conclude with practical, scalable metrics and analytic guidance to improve the design of qualitative studies and the evaluation of automated interview systems.

Abstract

Qualitative interviews provide essential insights into human experiences when they elicit high-quality responses. While qualitative and NLP researchers have proposed various measures of interview quality, these measures lack validation that high-scoring responses actually contribute to the study's goals. In this work, we identify, implement, and evaluate 10 proposed measures of interview response quality to determine which are actually predictive of a response's contribution to the study findings. To conduct our analysis, we introduce the Qualitative Interview Corpus, a newly constructed dataset of 343 interview transcripts with 16,940 participant responses from 14 real research projects. We find that direct relevance to a key research question is the strongest predictor of response quality. We additionally find that two measures commonly used to evaluate NLP interview systems, clarity and surprisal-based informativeness, are not predictive of response quality. Our work provides analytic insights and grounded, scalable metrics to inform the design of qualitative studies and the evaluation of automated interview systems.

What Makes a Good Response? An Empirical Analysis of Quality in Qualitative Interviews | AI Navigate