One Persona, Many Cues, Different Results: How Sociodemographic Cues Impact LLM Personalization

arXiv cs.CL / 4/27/2026

💬 OpinionIdeas & Deep AnalysisModels & Research

Key Points

  • The paper studies how using “persona” cues (sociodemographic user attributes embedded in prompts) affects LLM personalization, noting that while it can improve UX it may also increase bias and unfair outcomes.
  • It challenges prior work that relied on a single cue to activate a persona, arguing this can ignore real-world differences in prompt phrasing and the rarity of certain cues, reducing external validity.
  • Across six commonly used persona cues tested on seven open and proprietary LLMs over multiple writing and advice tasks, the cues are often correlated, but they still lead to large response differences between personas.
  • The authors warn that conclusions about persona-induced differences and bias may change depending on which single cue is used, especially when the cue is overly explicit or unrealistic.
  • Overall, the study recommends caution in making strong claims from experiments that vary only one persona cue, because cue choice can materially alter results.

Abstract

Personalization of LLMs by sociodemographic subgroup often improves user experience, but can also introduce or amplify biases and unfair outcomes across groups. Prior work has employed so-called personas, sociodemographic user attributes conveyed to a model, to study bias in LLMs by relying on a single cue to prompt a persona, such as user names or explicit attribute mentions. This disregards LLM sensitivity to prompt variation and the rarity of some cues in real interactions (external validity). We compare six commonly used persona cues across seven open and proprietary LLMs on four writing and advice tasks. While cues are overall highly correlated, they produce substantial variance in responses across personas that can change findings on persona-induced differences and bias. We therefore caution against claims based on single persona cues, especially when they are overly explicit and have low external validity.