Mapping how LLMs debate societal issues when shadowing human personality traits, sociodemographics and social media behavior

arXiv cs.CL / 5/1/2026

📰 NewsDeveloper Stack & InfrastructureModels & Research

共有:

Key Points

The paper introduces “Cognitive Digital Shadows (CDS),” a synthetic 190,000-record dataset designed to analyze how LLM-generated discourse changes under persona- and context-conditioned prompting.
CDS is generated using 19 different LLMs, where each output is produced by prompting the model to “shadow” either a human persona or an AI-assistant role.
The dataset covers four controversial societal topics—vaccines/healthcare, social media disinformation, the gender gap in science, and STEM stereotypes—and includes structured persona attributes (17 sociodemographic/psychological factors) to connect prompts, language, stances, and reasoning.
Texts are validated for topic anchoring and the corpus can be used for emotional analysis using interpretable NLP methods such as textual “forma mentis” networks.
A pooling platform with interactive dashboards supports group-level comparisons of emotional and semantic framing across personas, topics, and models, enabling future audits of bias, social sensitivity, and alignment.

Abstract

Large Language Models (LLMs) can strongly shape social discourse, yet datasets investigating how LLM outputs vary across controlled social and contextual prompting remain sparse. Cognitive Digital Shadows (CDS) is a 190,000-record synthetic corpus supporting analyses of LLM-generated discourse. Each CDS record is generated by one of 19 LLMs, prompted to shadow either a human persona or an AI-assistant role. CDS contains LLM responses on 4 controversial societal topics: vaccines/healthcare, social media disinformation, the gender gap in science, and STEM stereotypes. Persona-conditioned records encode 17 sociodemographic and psychological attributes, providing data linking LLMs' prompts, language, stances and reasoning. Texts are validated for topic anchoring and can support emotional analyses via interpretable NLP (e.g. textual forma mentis networks). CDS is enriched by a pooling platform with user-friendly dashboards, enabling easy, interactive group-level comparisons of emotional and semantic framing across personas, topics and models. The CDS prompting framework supports future audits of LLMs' bias, social sensitivity and alignment.

Why Autonomous Coding Agents Keep Failing — And What Actually Works

Dev.to

Text-to-image is easy. Chaining LLMs to generate, critique, and iterate on images autonomously is a routing nightmare. AgentSwarms now supports Image generation playground and creative media workflows!

Reddit r/artificial

Automating FDA Compliance: AI for Specialty Food Producers

Dev.to

Mistral's new flagship Medium 3.5 folds chat, reasoning, and code into one model

THE DECODER

I hate this group but not literally

Reddit r/LocalLLaMA

Mapping how LLMs debate societal issues when shadowing human personality traits, sociodemographics and social media behavior

Key Points

Abstract

Related Articles

Why Autonomous Coding Agents Keep Failing — And What Actually Works

Text-to-image is easy. Chaining LLMs to generate, critique, and iterate on images autonomously is a routing nightmare. AgentSwarms now supports Image generation playground and creative media workflows!

Automating FDA Compliance: AI for Specialty Food Producers

Mistral's new flagship Medium 3.5 folds chat, reasoning, and code into one model

I hate this group but not literally

関連おすすめサービス

Notta搭載AI議事録イヤホン ZENCHORD1

AI搭載ボイスレコーダー Plaud

画像高画質化AIツール Aiarty Image Enhancer