Beyond Static Personas: Situational Personality Steering for Large Language Models

arXiv cs.CL / 4/16/2026

📰 NewsSignals & Early TrendsIdeas & Deep AnalysisModels & Research

Key Points

  • The paper argues that current personalized LLM approaches are limited by low controllability, high compute costs, and reliance on static persona modeling that cannot adapt well to different situations.
  • It provides evidence that LLM personalities show situation-dependent but consistent behavior patterns, using multi-perspective analysis of “persona neurons.”
  • It introduces IRIS, a training-free, neuron-based Identify–Retrieve–Steer framework that identifies situation-relevant persona neurons, retrieves them in a situation-aware way, and applies similarity-weighted steering.
  • The method is evaluated on PersonalityBench and SPBench, a new benchmark covering situational personality scenarios, and is reported to outperform strong baseline methods.
  • Results indicate IRIS generalizes better and remains robust on complex, unseen situations as well as across different model architectures.

Abstract

Personalized Large Language Models (LLMs) facilitate more natural, human-like interactions in human-centric applications. However, existing personalization methods are constrained by limited controllability and high resource demands. Furthermore, their reliance on static personality modeling restricts adaptability across varying situations. To address these limitations, we first demonstrate the existence of situation-dependency and consistent situation-behavior patterns within LLM personalities through a multi-perspective analysis of persona neurons. Building on these insights, we propose IRIS, a training-free, neuron-based Identify-Retrieve-Steer framework for advanced situational personality steering. Our approach comprises situational persona neuron identification, situation-aware neuron retrieval, and similarity-weighted steering. We empirically validate our framework on PersonalityBench and our newly introduced SPBench, a comprehensive situational personality benchmark. Experimental results show that our method surpasses best-performing baselines, demonstrating IRIS's generalization and robustness to complex, unseen situations and different models architecture.