AI Navigate

Parametric Social Identity Injection and Diversification in Public Opinion Simulation

arXiv cs.CL / 3/18/2026

📰 NewsIdeas & Deep AnalysisModels & Research

Key Points

  • The paper identifies a Diversity Collapse phenomenon in LLM-based public opinion simulations where social identities become indistinguishable across layers.
  • It proposes Parametric Social Identity Injection (PSII), a framework that injects explicit demographic attributes and value orientations into intermediate hidden states of LLMs.
  • PSII enables fine-grained, controllable identity modulation at the representation level, unlike traditional prompt-based persona conditioning.
  • Experiments on the World Values Survey with multiple open-source LLMs show PSII improves distributional fidelity and diversity, reducing KL divergence to real-world data.
  • The work provides insights into representation-level control of LLM agents and includes code and data at the linked GitHub repository for reproducibility.

Abstract

Large language models (LLMs) have recently been adopted as synthetic agents for public opinion simulation, offering a promising alternative to costly and slow human surveys. Despite their scalability, current LLM-based simulation methods fail to capture social diversity, producing flattened inter-group differences and overly homogeneous responses within demographic groups. We identify this limitation as a Diversity Collapse phenomenon in LLM hidden representations, where distinct social identities become increasingly indistinguishable across layers. Motivated by this observation, we propose Parametric Social Identity Injection (PSII), a general framework that injects explicit, parametric representations of demographic attributes and value orientations directly into intermediate hidden states of LLMs. Unlike prompt-based persona conditioning, PSII enables fine-grained and controllable identity modulation at the representation level. Extensive experiments on the World Values Survey using multiple open-source LLMs show that PSII significantly improves distributional fidelity and diversity, reducing KL divergence to real-world survey data while enhancing overall diversity. This work provides new insights into representation-level control of LLM agents and advances scalable, diversity-aware public opinion simulation. Code and data are available at https://github.com/halsayxi/PSII.