Follow the Clues, Frame the Truth: Hybrid-evidential Deductive Reasoning in Open-Vocabulary Multimodal Emotion Recognition
arXiv cs.AI / 3/18/2026
📰 NewsIdeas & Deep AnalysisModels & Research
Key Points
- HyDRA introduces Hybrid-evidential Deductive Reasoning Architecture for open-vocabulary multimodal emotion recognition to address ambiguity from diverse, unobserved situational cues.
- It models inference as a Propose-Verify-Decide protocol and leverages reinforcement learning with hierarchical reward shaping to align reasoning trajectories with final task performance.
- The approach yields improved performance over strong baselines, particularly in ambiguous or conflicting multimodal scenarios, with interpretable diagnostic evidence traces.
- The framework emphasizes reconstructing nuanced emotional states by synthesizing multiple evidence-grounded rationales from diverse latent perspectives, moving beyond surface-level associations.
- The paper provides systematic evaluations validating its design choices and interpretable reasoning traces, suggesting practical benefits for robust MER systems.




