RADMI: Latent Information Aggregation as a Proxy for Model Uncertainty

arXiv cs.CV / 5/5/2026

📰 NewsIdeas & Deep AnalysisModels & Research

共有:

Key Points

RADMI (Resolution-Aggregated Decoder Mutual Information) is a single-pass technique for estimating epistemic uncertainty in segmentation networks without using costly ensembles or multiple stochastic forward passes.
It computes mutual information (MI) between consecutive decoder layers, finding that higher inter-layer MI aligns with higher prediction uncertainty—especially near ambiguous class boundaries.
On a seismic facies segmentation benchmark, RADMI shows the strongest correlation with deep ensemble uncertainty among single-pass methods, improving Pearson correlation by 5.5% and Spearman by 10.7% over the best baseline.
The method produces sharp, boundary-localized uncertainty maps while requiring no architectural modifications, using linear aggregation of normalized information flow as an efficient uncertainty proxy.
Overall, the work proposes a scalable uncertainty-estimation approach for dense prediction tasks in encoder–decoder models by leveraging information-flow signals across decoder layers.

Abstract

Epistemic uncertainty estimation is essential for identifying regions where deep learning system outputs may be unreliable. However, existing approaches require computationally expensive ensemble methods or multiple stochastic forward passes, limiting their scalability to dense prediction tasks like segmentation. We propose Resolution-Aggregated Decoder Mutual Information (RADMI), a single-pass method that estimates prediction uncertainty by measuring mutual information (MI) between consecutive decoder layers in segmentation networks. We observe that elevated inter-layer MI correlates with prediction uncertainty, as the network must integrate conflicting contextual information at ambiguous regions such as class boundaries. Evaluating on a seismic facies segmentation benchmark, RADMI achieves the highest correlation with deep ensemble uncertainty among all single-pass methods, outperforming the next-best baselines by 5.5% in Pearson and 10.7% in Spearman correlation coefficients. Compared to baselines that either lack spatial precision or demand significant computational overhead, RADMI yields sharp, boundary-localized uncertainty maps without architectural modifications. Our results suggest that linear aggregation of normalized information flow provides a principled and efficient proxy for prediction uncertainty in encoder-decoder architectures.

The 55.6% problem: why frontier LLMs fail at embedded code

Dev.to

Four CVEs in a week, all the same shape: when agents execute LLM-generated code

Dev.to

Healthcare AI Is Absorbing Institutional Knowledge It Can't Actually Hold

Reddit r/artificial

The Transformer: The Architecture Behind Modern AI

Dev.to

Foundational Models Defining a New Era in Vision: A Survey and Outlook

Dev.to

RADMI: Latent Information Aggregation as a Proxy for Model Uncertainty

Key Points

Abstract

Related Articles

The 55.6% problem: why frontier LLMs fail at embedded code

Four CVEs in a week, all the same shape: when agents execute LLM-generated code

Healthcare AI Is Absorbing Institutional Knowledge It Can't Actually Hold

The Transformer: The Architecture Behind Modern AI

Foundational Models Defining a New Era in Vision: A Survey and Outlook

関連おすすめサービス

Notta搭載AI議事録イヤホン ZENCHORD1

AI搭載ボイスレコーダー Plaud

画像高画質化AIツール Aiarty Image Enhancer