Quantification of Credal Uncertainty: A Distance-Based Approach

arXiv cs.AI / 3/31/2026

💬 OpinionSignals & Early TrendsIdeas & Deep AnalysisModels & Research

Key Points

  • The paper addresses how to quantify aleatoric (data) and epistemic (model) uncertainty for credal sets—convex sets of probability measures—especially in multiclass classification.
  • It proposes a distance-based uncertainty quantification framework using Integral Probability Metrics (IPMs), yielding measures that have interpretable meanings and satisfy desirable theoretical properties.
  • The authors show computational tractability for common IPM choices and specifically instantiate the framework with total variation distance to derive efficient multiclass uncertainty measures.
  • In the binary setting, the proposed approach is consistent with existing established uncertainty measures, while providing a principled generalization to multiclass cases.
  • Experiments indicate the method is practically useful and achieves favorable performance with low computational overhead.

Abstract

Credal sets, i.e., closed convex sets of probability measures, provide a natural framework to represent aleatoric and epistemic uncertainty in machine learning. Yet how to quantify these two types of uncertainty for a given credal set, particularly in multiclass classification, remains underexplored. In this paper, we propose a distance-based approach to quantify total, aleatoric, and epistemic uncertainty for credal sets. Concretely, we introduce a family of such measures within the framework of Integral Probability Metrics (IPMs). The resulting quantities admit clear semantic interpretations, satisfy natural theoretical desiderata, and remain computationally tractable for common choices of IPMs. We instantiate the framework with the total variation distance and obtain simple, efficient uncertainty measures for multiclass classification. In the binary case, this choice recovers established uncertainty measures, for which a principled multiclass generalization has so far been missing. Empirical results confirm practical usefulness, with favorable performance at low computational cost.