Mapping how LLMs debate societal issues when shadowing human personality traits, sociodemographics and social media behavior
arXiv cs.CL / 5/1/2026
📰 NewsDeveloper Stack & InfrastructureModels & Research
Key Points
- The paper introduces “Cognitive Digital Shadows (CDS),” a synthetic 190,000-record dataset designed to analyze how LLM-generated discourse changes under persona- and context-conditioned prompting.
- CDS is generated using 19 different LLMs, where each output is produced by prompting the model to “shadow” either a human persona or an AI-assistant role.
- The dataset covers four controversial societal topics—vaccines/healthcare, social media disinformation, the gender gap in science, and STEM stereotypes—and includes structured persona attributes (17 sociodemographic/psychological factors) to connect prompts, language, stances, and reasoning.
- Texts are validated for topic anchoring and the corpus can be used for emotional analysis using interpretable NLP methods such as textual “forma mentis” networks.
- A pooling platform with interactive dashboards supports group-level comparisons of emotional and semantic framing across personas, topics, and models, enabling future audits of bias, social sensitivity, and alignment.
Related Articles

Why Autonomous Coding Agents Keep Failing — And What Actually Works
Dev.to

Text-to-image is easy. Chaining LLMs to generate, critique, and iterate on images autonomously is a routing nightmare. AgentSwarms now supports Image generation playground and creative media workflows!
Reddit r/artificial

Automating FDA Compliance: AI for Specialty Food Producers
Dev.to

Mistral's new flagship Medium 3.5 folds chat, reasoning, and code into one model
THE DECODER
I hate this group but not literally
Reddit r/LocalLLaMA