Convolutional Maximum Mean Discrepancy for Inference in Noisy Data

arXiv stat.ML / 4/15/2026

💬 OpinionIdeas & Deep AnalysisModels & Research

共有:

Key Points

The paper proposes a new inference framework for data contaminated by measurement error, including potentially heteroscedastic noise drawn from a known distribution.
It introduces convolutional Maximum Mean Discrepancy (convMMD), which compares distributions after convolving with the noise while preserving metric validity under standard kernel assumptions.
The authors derive finite-sample deviation bounds that remain unaffected by measurement error and show an equivalence between hypothesis testing under noise and kernel smoothing.
They present a convMMD-based estimator with proofs of consistency and asymptotic normality, along with an efficient implementation using stochastic gradient descent.
Experiments and real-world applications (notably in astronomy and social sciences) demonstrate the method’s practical effectiveness under noisy observational settings.

Abstract

Modern data analyses frequently encounter settings where samples of variables are contaminated by measurement error. Ignoring measurement noise can substantially degrade statistical inference, while existing correction techniques are often computationally costly and inefficient. Recent advances in kernel methods, particularly those based on Maximum Mean Discrepancy (MMD), have enabled flexible, distribution-free inference, yet typically assume precise data and overlook contamination by measurement error. In this work, we introduce a novel framework for inference with samples corrupted by potentially heteroscedastic noise from a known distribution. Central to our approach is the convolutional MMD (convMMD), which compares distributions after noise convolution and retains metric validity under standard kernel conditions. We establish finite-sample deviation bounds that are unaffected by measurement error and prove an equivalence between testing under noise and kernel smoothing. Leveraging these insights, we introduce a convMMD-based estimator for inference with noisy, heteroscedastic observations. We establish its consistency and asymptotic normality, and provide an efficient implementation using stochastic gradient descent. We demonstrate the practical effectiveness of our approach through simulations and applications in astronomy and social sciences.

Vibe Coding Is Changing How We Build Software. ERP Teams Should Pay Attention

Dev.to

I scanned every major vibe coding tool for security. None scored above 90.

Dev.to

I Finally Checked What My AI Coding Tools Actually Cost. The Number Made No Sense.

Dev.to

Is it actually possible to build a model-agnostic persistent text layer that keeps AI behavior stable?

Reddit r/artificial

Give me your ideass [N]

Reddit r/MachineLearning

Convolutional Maximum Mean Discrepancy for Inference in Noisy Data

Key Points

Abstract

Related Articles

Vibe Coding Is Changing How We Build Software. ERP Teams Should Pay Attention

I scanned every major vibe coding tool for security. None scored above 90.

I Finally Checked What My AI Coding Tools Actually Cost. The Number Made No Sense.

Is it actually possible to build a model-agnostic persistent text layer that keeps AI behavior stable?

Give me your ideass [N]

関連おすすめサービス

Notta搭載AI議事録イヤホン ZENCHORD1

AI搭載ボイスレコーダー Plaud

画像高画質化AIツール Aiarty Image Enhancer