How Emotion Shapes the Behavior of LLMs and Agents: A Mechanistic Study

arXiv cs.AI / 4/2/2026

💬 OpinionSignals & Early TrendsIdeas & Deep AnalysisModels & Research

Key Points

  • The paper studies how “emotion” signals can mechanistically influence the behavior of LLMs and AI agents, going beyond prior work that treated emotion as a surface style or output target.
  • It introduces E-STEER, an interpretable emotion steering framework that embeds emotion as a structured, controllable variable in model hidden states for representation-level intervention.
  • Experiments analyze how emotion affects objective reasoning, subjective generation, safety outcomes, and multi-step agent behavior across task settings.
  • Results show non-monotonic emotion–behavior relationships that align with psychological theories, and indicate that certain emotions can improve both capability and safety.
  • The findings suggest emotion can be used as a systematic control signal to shape agent trajectories across multiple steps, not just to alter text tone.

Abstract

Emotion plays an important role in human cognition and performance. Motivated by this, we investigate whether analogous emotional signals can shape the behavior of large language models (LLMs) and agents. Existing emotion-aware studies mainly treat emotion as a surface-level style factor or a perception target, overlooking its mechanistic role in task processing. To address this limitation, we propose E-STEER, an interpretable emotion steering framework that enables direct representation-level intervention in LLMs and agents. It embeds emotion as a structured, controllable variable in hidden states, and with it, we examine the impact of emotion on objective reasoning, subjective generation, safety, and multi-step agent behaviors. The results reveal non-monotonic emotion-behavior relations consistent with established psychological theories, and show that specific emotions not only enhance LLM capability but also improve safety, and systematically shape multi-step agent behaviors.