Towards Trustworthy Depression Estimation via Disentangled Evidential Learning

arXiv cs.LG / 4/21/2026

📰 NewsIdeas & Deep AnalysisModels & Research

Key Points

  • The paper introduces EviDep, an evidential learning framework for automated depression estimation that outputs both severity and calibrated aleatoric/epistemic uncertainty using a Normal-Inverse-Gamma distribution.
  • It argues that deterministic point-estimation approaches are unsafe in real-world clinical settings because they can be overconfident under signal corruption and ambient noise.
  • The work targets a key failure mode in multimodal evidential fusion: uncontrolled accumulation of cross-modal redundancies that can artificially inflate diagnostic confidence through double-counting overlapping evidence.
  • EviDep improves robustness by using frequency-aware feature extraction (wavelet-based Mixture-of-Experts to filter task-irrelevant noise) and a disentangled evidential learning design that separates shared consensus from modality-specific details before Bayesian fusion.
  • Experiments on AVEC 2013/2014, DAIC-WOZ, and E-DAIC report state-of-the-art prediction accuracy along with better uncertainty calibration, aiming to provide a safer “fail-safe” clinical screening mechanism.

Abstract

Automated depression estimation is highly vulnerable to signal corruption and ambient noise in real-world deployment. Prevailing deterministic methods produce uncalibrated point estimates, exposing safety-critical clinical systems to the severe risk of overconfident misdiagnoses. To establish a highly resilient and trustworthy assessment paradigm, we propose EviDep, an evidential learning framework that jointly quantifies depression severity alongside aleatoric and epistemic uncertainties via a Normal-Inverse-Gamma distribution. A fundamental vulnerability in multimodal evidential fusion is the uncontrolled accumulation of cross-modal redundancies. This structural flaw artificially inflates diagnostic confidence by double-counting overlapping evidence. To guarantee robust evidence synthesis, EviDep enforces strict information integrity. First, a Frequency-aware Feature Extraction module leverages a wavelet-based Mixture-of-Experts to dynamically isolate task-irrelevant noise, preserving the fidelity of diagnostic signals. Subsequently, a Disentangled Evidential Learning strategy separates the shared consensus from modality-specific nuances. By explicitly decoupling these representations before Bayesian fusion, EviDep systematically mitigates evidence redundancy. Extensive experiments on AVEC 2013, 2014, DAIC-WOZ, and E-DAIC confirm that EviDep achieves state-of-the-art predictive accuracy and superior uncertainty calibration, delivering a robust fail-safe mechanism for trustworthy clinical screening.