Are Emotion and Rhetoric Neurons in LLM? Neuron Recognition and Adaptive Masking for Emotion-Rhetoric Prediction Steering

arXiv cs.CL / 4/21/2026

📰 NewsDeveloper Stack & InfrastructureIdeas & Deep AnalysisModels & Research

Key Points

  • The paper argues that precise, neuron-level control of emotion and rhetoric in LLMs requires better understanding of internal neuron representations than prior work that mainly uses external optimization.
  • It studies neuron representations and relationships for 6 emotion categories and 4 rhetorical devices, including both emotion neurons and “rhetoric neurons,” which earlier studies often ignored.
  • The authors introduce a neuron identification framework using multi-dimensional screening to more reliably locate relevant neurons, and they propose adaptive masking (dynamic filtering, attenuation masking, and feedback optimization) to support causal validation.
  • By regulating identified neurons, the method can steer generation away from target sentences and improve performance on emotion-related tasks using rhetoric neurons.
  • Experiments across five datasets demonstrate that the approach is effective and offers a new paradigm for fine-grained steering of emotion and rhetoric expression in LLMs.

Abstract

Accurate comprehension and controllable generation of emotion and rhetoric are pivotal for enhancing the reasoning capabilities of large language models (LLMs). Existing studies mostly rely on external optimizations, lacking in-depth exploration of internal representation mechanisms, thus failing to achieve fine-grained steering at the neuron level. A handful of works on neurons are confined to emotions, neglecting rhetoric neurons and their intrinsic connections. Traditional neuron masking also exhibits counterintuitive phenomena, making reliable verification of neuron functionality infeasible. To address these issues, we systematically investigate the neurons representation mechanisms and inherent associations of 6 emotion categories and 4 core rhetorical devices. We propose a neuron identification framework that integrates multi-dimensional screening, and design an adaptive masking method incorporating dynamic filtering, attenuation masking, and feedback optimization, enabling reliable causal validation of neuron functionality.Through neuron regulation, we achieve directed induction of non-target sentences and enhancement of emotion tasks via rhetoric neurons. Experiments on 5 commonly used datasets validate the effectiveness of our method, providing a novel paradigm for the fine-grained steering of emotion and rhetoric expressions in LLMs.