Contrastive Analysis of Linguistic Representations in Large Language Model Outputs through Structured Synthetic Data Generation and Abstracted N-gram Associations

arXiv cs.CL / 4/21/2026

📰 NewsIdeas & Deep AnalysisModels & Research

Key Points

  • The paper proposes a framework to uncover linguistic and discourse patterns linked to different social groups using contrastive synthetic text generation combined with statistical analysis.
  • Unlike word-list-based bias detection, it focuses on identifying subtle bias signals in contextualized productions rather than isolated words or sentences.
  • The method generates contextualized data by constructing controlled scenario-and-group-marker combinations to create minimal pairs that differ mainly by the referenced group while holding narrative conditions constant.
  • It generalizes linguistic forms and quantifies group-associated abstractions using a variant of pointwise mutual information, then applies a fragment-ranking strategy to surface segments with concentrated bias signals for expert review.
  • Overall, the approach aims to bridge quantitative measurement with qualitative assessment of harmful potential in context across narrative, task-oriented, and dialogue genres.

Abstract

We present a methodological framework to discover linguistic and discursive patterns associated to different social groups through contrastive synthetic text generation and statistical analysis. In contrast with previous approaches, we aim to characterize subtle expressions of bias, instead of diagnosing bias through a pre-determined list of words or expressions. We are also working with contextualized data instead of isolated words or sentences. Our methodology applies to textual productions in any genre, encompassing narrative, task-oriented or dialogic. Contextualized data are generated using controlled combinations of situational scenarios and group markers, creating minimal pairs of texts that differ only in the referenced group while maintaining comparable narrative conditions. To facilitate robust analysis, linguistic forms are generalized and associations between linguistic abstractions and groups are quantified using a variant of pointwise mutual information to detect expressions that appear disproportionately across groups. A fragment-ranking strategy then prioritizes text segments with a high concentration of biased linguistic signals, which allows for experts to assess the harmful potential of linguistic expressions in context, bridging quantitative analysis and qualitative interpretation.