RADMI: Latent Information Aggregation as a Proxy for Model Uncertainty

arXiv cs.CV / 5/5/2026

📰 NewsIdeas & Deep AnalysisModels & Research

Key Points

  • RADMI (Resolution-Aggregated Decoder Mutual Information) is a single-pass technique for estimating epistemic uncertainty in segmentation networks without using costly ensembles or multiple stochastic forward passes.
  • It computes mutual information (MI) between consecutive decoder layers, finding that higher inter-layer MI aligns with higher prediction uncertainty—especially near ambiguous class boundaries.
  • On a seismic facies segmentation benchmark, RADMI shows the strongest correlation with deep ensemble uncertainty among single-pass methods, improving Pearson correlation by 5.5% and Spearman by 10.7% over the best baseline.
  • The method produces sharp, boundary-localized uncertainty maps while requiring no architectural modifications, using linear aggregation of normalized information flow as an efficient uncertainty proxy.
  • Overall, the work proposes a scalable uncertainty-estimation approach for dense prediction tasks in encoder–decoder models by leveraging information-flow signals across decoder layers.

Abstract

Epistemic uncertainty estimation is essential for identifying regions where deep learning system outputs may be unreliable. However, existing approaches require computationally expensive ensemble methods or multiple stochastic forward passes, limiting their scalability to dense prediction tasks like segmentation. We propose Resolution-Aggregated Decoder Mutual Information (RADMI), a single-pass method that estimates prediction uncertainty by measuring mutual information (MI) between consecutive decoder layers in segmentation networks. We observe that elevated inter-layer MI correlates with prediction uncertainty, as the network must integrate conflicting contextual information at ambiguous regions such as class boundaries. Evaluating on a seismic facies segmentation benchmark, RADMI achieves the highest correlation with deep ensemble uncertainty among all single-pass methods, outperforming the next-best baselines by 5.5% in Pearson and 10.7% in Spearman correlation coefficients. Compared to baselines that either lack spatial precision or demand significant computational overhead, RADMI yields sharp, boundary-localized uncertainty maps without architectural modifications. Our results suggest that linear aggregation of normalized information flow provides a principled and efficient proxy for prediction uncertainty in encoder-decoder architectures.