Busemann energy-based attention for emotion analysis in Poincar\'e discs

arXiv cs.LG / 4/9/2026

📰 NewsSignals & Early TrendsIdeas & Deep AnalysisModels & Research

Key Points

  • The paper introduces EmBolic, a fully hyperbolic deep learning architecture for fine-grained emotion analysis from text, motivated by hyperbolic geometry’s ability to represent hierarchies.
  • Instead of treating emotions as discrete categories, the model infers a continuous “curvature” structure over an emotion space, with semantic ambiguity reflected as hierarchical word–emotion relations.
  • Its core is a hyperbolic attention mechanism on a Poincaré disc: the model generates query points from messages, derives key points on the boundary automatically, and makes predictions using Busemann energy between queries and keys.
  • Experiments report strong generalization and “reasonably good” accuracy even with small representation dimensions, suggesting the approach is parameter- and size-efficient.

Abstract

We present EmBolic - a novel fully hyperbolic deep learning architecture for fine-grained emotion analysis from textual messages. The underlying idea is that hyperbolic geometry efficiently captures hierarchies between both words and emotions. In our context, these hierarchical relationships arise from semantic ambiguities. EmBolic aims to infer the curvature on the continuous space of emotions, rather than treating them as a categorical set without any metric structure. In the heart of our architecture is the attention mechanism in the hyperbolic disc. The model is trained to generate queries (points in the hyperbolic disc) from textual messages, while keys (points at the boundary) emerge automatically from the generated queries. Predictions are based on the Busemann energy between queries and keys, evaluating how well a certain textual message aligns with the class directions representing emotions. Our experiments demonstrate strong generalization properties and reasonably good prediction accuracy even for small dimensions of the representation space. Overall, this study supports our claim that affective computing is one of the application domains where hyperbolic representations are particularly advantageous.