The Pragmatic Persona: Discovering LLM Persona through Bridging Inference

arXiv cs.CL / 4/28/2026

📰 NewsIdeas & Deep AnalysisModels & Research

Key Points

  • The paper introduces a new framework for discovering LLM personas using “bridging inference,” focusing on implicit conceptual relations that connect dialogue turns.
  • Instead of treating chat as a flat sequence of tokens or relying mainly on lexical/style cues, it models discourse relations as structured knowledge graphs to capture deeper coherence.
  • Experiments across multiple reasoning backbones and target LLMs (from small models to 80B-parameter systems) show improved semantic coherence and more stable persona identification versus frequency or style-based baselines.
  • The authors argue that persona traits are encoded in the structural organization of discourse rather than in isolated wording patterns, and they ground the approach in Cognitive Discourse Theory.
  • The work provides an implementation and links code via a public GitHub repository.

Abstract

Large Language Models (LLMs) reveal inherent and distinctive personas through dialogue. However, most existing persona discovery approaches rely on surface-level lexical or stylistic cues, treating dialogue as a flat sequence of tokens and failing to capture the deeper discourse-level structures that sustain persona consistency. To address this limitation, we propose a novel analytical framework that interprets LLM dialogue through bridging inference -- implicit conceptual relations that connect utterances via shared world knowledge and discourse coherence. By modeling these relations as structured knowledge graphs, our approach captures latent semantic links that govern how LLMs organize meaning across turns, enabling persona discovery at the level of discourse coherence rather than surface realizations. Experimental results across multiple reasoning backbones and target LLMs, ranging from small-scale models to 80B-parameter systems, demonstrate that bridging-inference graphs yield significantly stronger semantic coherence and more stable persona identification than frequency or style-based baselines. These results show that persona traits are consistently encoded in the structural organization of discourse rather than isolated lexical patterns. This work presents a systematic framework for probing, extracting, and visualizing latent LLM personas through the lens of Cognitive Discourse Theory, bridging computational linguistics, cognitive semantics, and persona reasoning in large language models. Codes are available at https://github.com/JiSoo-Yang/Persona_Bridging.git