BALM: A Model-Agnostic Framework for Balanced Multimodal Learning under Imbalanced Missing Rates
arXiv cs.CV / 3/23/2026
📰 NewsIdeas & Deep AnalysisModels & Research
Key Points
- BALM is a model-agnostic plug-in framework designed to enable balanced multimodal learning under imbalanced missing rates (IMR), addressing the dominance of information-rich modalities.
- It includes two modules: the Feature Calibration Module (FCM) that recalibrates unimodal features using global context to align representations across missing patterns, and the Gradient Rebalancing Module (GRM) that modulates gradient magnitudes and directions to balance learning across modalities from distributional and spatial perspectives.
- BALM is backbone-agnostic and can be integrated into diverse architectures, including multimodal emotion recognition (MER) models, without changing their structure.
- Experimental results on multiple MER benchmarks show BALM improves robustness and performance under various missing/imbalance settings, with code available at the provided GitHub repository.
Related Articles
How political censorship actually works inside Qwen, DeepSeek, GLM, and Yi: Ablation and behavioral results across 9 models
Reddit r/LocalLLaMA
Engenharia de Prompt: Por Que a Forma Como Você Pergunta Muda Tudo(Um guia introdutório)
Dev.to
The Obligor
Dev.to
The Markup
Dev.to
2026 年 AI 部落格變現完整攻略:從第一篇文章到月收入 $1000
Dev.to