Analysing LLM Persona Generation and Fairness Interpretation in Polarised Geopolitical Contexts

arXiv cs.CL / 3/25/2026

💬 OpinionSignals & Early TrendsIdeas & Deep AnalysisModels & Research

Key Points

  • The paper studies how five popular LLMs generate persona attributes for Palestinian and Israeli identities under 640 experimental conditions, varying context (war vs non-war) and assigned roles.
  • Results show systematic distributional differences: Palestinian personas in war contexts are more often tied to lower socioeconomic status and survival-oriented roles, while Israeli personas more often preserve middle-class and specialized professional traits.
  • Even when models are instructed to avoid harmful assumptions, fairness-oriented prompting produces diverse changes (e.g., more non-binary gender inferences or shifts toward generic occupations like “student”), while socioeconomic separations largely persist.
  • Analysis of reasoning traces finds that fairness-related concepts appear in rationales, but these do not consistently translate into uniform or direct fairness-aligned changes in the final generated personas.
  • Overall, the study suggests LLMs interpret polarized geopolitical contexts in patterned ways, and that “fairness” signals may affect explanations and outputs through non-uniform mechanisms rather than a single reliable mapping.

Abstract

Large language models (LLMs) are increasingly utilised for social simulation and persona generation, necessitating an understanding of how they represent geopolitical identities. In this paper, we analyse personas generated for Palestinian and Israeli identities by five popular LLMs across 640 experimental conditions, varying context (war vs non-war) and assigned roles. We observe significant distributional patterns in the generated attributes: Palestinian profiles in war contexts are frequently associated with lower socioeconomic status and survival-oriented roles, whereas Israeli profiles predominantly retain middle-class status and specialised professional attributes. When prompted with explicit instructions to avoid harmful assumptions, models exhibit diverse distributional changes, e.g., marked increases in non-binary gender inferences or a convergence toward generic occupational roles (e.g., "student"), while the underlying socioeconomic distinctions often remain. Furthermore, analysis of reasoning traces reveals an interesting dynamics between model reasoning and generation: while rationales consistently mention fairness-related concepts, the final generated personas follow the aforementioned diverse distributional changes. These findings illustrate a picture of how models interpret geopolitical contexts, while suggesting that they process fairness and adjust in varied ways; there is no consistent, direct translation of fairness concepts into representative outcomes.