Bayesian Scattering: A Principled Baseline for Uncertainty on Image Data

arXiv cs.LG / 3/24/2026

💬 OpinionIdeas & Deep AnalysisModels & Research

Key Points

  • The paper argues that uncertainty quantification for image data is currently dominated by complex deep learning, but lacks an interpretable mathematical baseline comparable to Bayesian linear regression for tabular data.
  • It proposes “Bayesian scattering,” which combines a non-learned wavelet scattering transform (feature extractor) with a simple probabilistic head to produce uncertainty estimates.
  • Because the scattering features come from geometric principles rather than training data, the method aims to reduce overfitting to the training distribution.
  • The approach is reported to yield sensible uncertainty under substantial distribution shifts and is evaluated on medical imaging (institution shift), wealth mapping (country-to-country shift), and Bayesian optimization for molecular properties.
  • Overall, the authors position Bayesian scattering as a strong principled baseline to benchmark or complement more complex uncertainty quantification methods.

Abstract

Uncertainty quantification for image data is dominated by complex deep learning methods, yet the field lacks an interpretable, mathematically grounded baseline. We propose Bayesian scattering to fill this gap, serving as a first-step baseline akin to the role of Bayesian linear regression for tabular data. Our method couples the wavelet scattering transform-a deep, non-learned feature extractor-with a simple probabilistic head. Because scattering features are derived from geometric principles rather than learned, they avoid overfitting the training distribution. This helps provide sensible uncertainty estimates even under significant distribution shifts. We validate this on diverse tasks, including medical imaging under institution shift, wealth mapping under country-to-country shift, and Bayesian optimization of molecular properties. Our results suggest that Bayesian scattering is a solid baseline for complex uncertainty quantification methods.