Linguistic Signatures for Enhanced Emotion Detection

arXiv cs.CL / 3/24/2026

💬 OpinionSignals & Early TrendsIdeas & Deep AnalysisModels & Research

Key Points

  • The paper studies whether interpretable linguistic regularities (emotion-specific “linguistic signatures”) can reliably indicate emotions across multiple text datasets and label sets.
  • Researchers extract linguistic feature signatures from 13 English emotion datasets and test whether adding these high-level features to transformer models improves emotion classification.
  • RoBERTa-based models augmented with the linguistic signatures show consistent improvements, reaching up to +2.4 macro F1 on the GoEmotions benchmark.
  • The results suggest that explicit lexical cues can complement transformer representations and enhance robustness for emotion category prediction.

Abstract

Emotion detection is a central problem in NLP, with recent progress driven by transformer-based models trained on established datasets. However, little is known about the linguistic regularities that characterize how emotions are expressed across different corpora and labels. This study examines whether linguistic features can serve as reliable interpretable signals for emotion recognition in text. We extract emotion-specific linguistic signatures from 13 English datasets and evaluate how incorporating these features into transformer models impacts performance. Our RoBERTa-based models enriched with high level linguistic features achieve consistent performance gains of up to +2.4 macro F1 on the GoEmotions benchmark, showing that explicit lexical cues can complement neural representations and improve robustness in predicting emotion categories.