A-MBER: Affective Memory Benchmark for Emotion Recognition
arXiv cs.AI / 4/10/2026
📰 NewsSignals & Early TrendsIdeas & Deep AnalysisModels & Research
Key Points
- A-MBER is introduced as an Affective Memory Benchmark to evaluate whether AI assistants can infer a user’s current emotional state using remembered multi-session interaction history rather than only instantaneous cues.
- The benchmark requires models to identify historically relevant evidence, ground their affective interpretation, and justify it based on an interaction trajectory and an anchor turn.
- It is built via a staged pipeline with intermediate representations (including long-horizon planning and structured question construction) and supports judgment, retrieval, and explanation tasks.
- Robustness is explicitly tested through settings like modality degradation and insufficient-evidence conditions to assess how well models handle missing or degraded signals.
- Experiments compare multiple memory integration conditions and find A-MBER is particularly discriminative on long-range implicit affect and trajectory-based, dependency-heavy, and adversarial scenarios.



