AI generates well-liked but templatic empathic responses

arXiv cs.CL / 2026/4/10

💬 オピニオンSignals & Early TrendsIdeas & Deep AnalysisModels & Research

要点

  • The paper argues that LLMs’ “empathic” performance comes from reliably deploying a popular, well-liked template for expressing empathy rather than from deeper interpersonal understanding.
  • It introduces a taxonomy of 10 empathic language tactics (e.g., validating feelings and paraphrasing) and uses this framework to analyze both human- and AI-written empathic responses.
  • Across two studies totaling 4,555 responses, LLM outputs are found to be highly formulaic at the discourse-functional level, with a structured tactic-sequence template matching most AI responses.
  • Human-written empathic responses are more varied, while the AI template accounts for the majority of content overlap when tactic matches occur.
  • The authors conclude with implications for how AI-generated empathy may evolve and how such templatic language should be interpreted or evaluated in practice.

Abstract

Recent research shows that greater numbers of people are turning to Large Language Models (LLMs) for emotional support, and that people rate LLM responses as more empathic than human-written responses. We suggest a reason for this success: LLMs have learned and consistently deploy a well-liked template for expressing empathy. We develop a taxonomy of 10 empathic language "tactics" that include validating someone's feelings and paraphrasing, and apply this taxonomy to characterize the language that people and LLMs produce when writing empathic responses. Across a set of 2 studies comparing a total of n = 3,265 AI-generated (by six models) and n = 1,290 human-written responses, we find that LLM responses are highly formulaic at a discourse functional level. We discovered a template -- a structured sequence of tactics -- that matches between 83--90% of LLM responses (and 60--83\% in a held out sample), and when those are matched, covers 81--92% of the response. By contrast, human-written responses are more diverse. We end with a discussion of implications for the future of AI-generated empathy.