IMA-MoE: An Interpretable Modality-Aware Mixture-of-Experts Framework for Characterizing the Neurobiological Signatures of Binge Eating Disorder

arXiv cs.CV / 4/21/2026

📰 NewsIdeas & Deep AnalysisModels & Research

Key Points

  • The paper introduces IMA-MoE, an interpretable modality-aware mixture-of-experts framework aimed at characterizing binge eating disorder (BED) using biological mechanisms rather than symptom-only criteria.
  • IMA-MoE integrates heterogeneous multimodal data—neuroimaging, behavioral, hormonal, and demographic measures—by representing each measure as a token to model cross-modal dependencies while maintaining modality-specific information.
  • To improve transparency, the method adds a token-importance mechanism that quantifies how much each measure contributes to the model’s BED vs. healthy-control predictions.
  • On the large-scale ABCD dataset, IMA-MoE outperforms baseline approaches in distinguishing BED from healthy controls and uncovers sex-specific patterns, with hormonal measures playing a larger predictive role for females.
  • Overall, the study suggests that interpretable, data-driven multimodal modeling could support more biologically informed and potentially more personalized interventions for BED and related neuropsychiatric conditions.

Abstract

Binge eating disorder (BED) is the most prevalent eating disorder. However, current diagnostic frameworks remain largely grounded in symptom-based criteria rather than underlying biological mechanisms, thereby limiting early detection and the development of biologically-informed interventions. Emerging studies have begun to investigate the neurobiological signatures of BED, yet their findings are often difficult to generalize due to the reliance on hypothesis-driven parametric models, single-modality analyses, and limited data diversity. Therefore, there is a critical need for advanced data-driven frameworks capable of modeling multimodal data to uncover generalizable and biologically meaningful signatures of BED. In this study, we propose the Interpretable Modality-Aware Mixture-of-Experts (IMA-MoE), a novel architecture designed to integrate heterogeneous neuroimaging, behavioral, hormonal, and demographic measures within a unified predictive framework. By encoding each measure as a distinct token, IMA-MoE enables flexible modeling of cross-modal dependencies while preserving modality-specific characteristics. We further introduce a token-importance mechanism to enhance interpretability by quantifying the contribution of each measure to model predictions. Evaluated on the large-scale Adolescent Brain Cognitive Development (ABCD) dataset, IMA-MoE demonstrates superior performance in differentiating BED from healthy controls compared with baseline methods, while revealing sex-specific predictive patterns, with hormonal measures contributing more prominently to prediction in females. Collectively, these findings highlight the promise of interpretable, data-driven multimodal modeling in advancing biologically-informed characterization of BED and facilitating more precise and personalized interventions in neuropsychiatric disorders.