Computationally lightweight classifiers with frequentist bounds on predictions

arXiv stat.ML / 2026/3/24

💬 オピニオンSignals & Early TrendsIdeas & Deep AnalysisModels & Research

要点

  • The paper proposes a computationally efficient classifier that uses a Nadaraya–Watson estimator to make predictions with frequentist uncertainty intervals.
  • It argues that many high-accuracy classical and neural classifiers lack reliable uncertainty bounds, which limits their use in safety-critical settings.
  • Compared with existing kernel-based bounded classifiers that can scale around O(n^3), the proposed approach is evaluated to run in about O(n) and O(log n) time/operations for their setup.
  • Experiments on synthetic data and MIT-BIH arrhythmia ECG heartbeat signals show competitive accuracy above 96% while also producing actionable uncertainty bounds.
  • The authors highlight practical use cases such as flagging low-confidence predictions for real-time diagnostics and potentially resource-constrained medical devices like implantables.

Abstract

While both classical and neural network classifiers can achieve high accuracy, they fall short on offering uncertainty bounds on their predictions, making them unfit for safety-critical applications. Existing kernel-based classifiers that provide such bounds scale with \mathcal O (n^{\sim3}) in time, making them computationally intractable for large datasets. To address this, we propose a novel, computationally efficient classification algorithm based on the Nadaraya-Watson estimator, for whose estimates we derive frequentist uncertainty intervals. We evaluate our classifier on synthetically generated data and on electrocardiographic heartbeat signals from the MIT-BIH Arrhythmia database. We show that the method achieves competitive accuracy >\SI{96}{\percent} at \mathcal O(n) and \mathcal O(\log n) operations, while providing actionable uncertainty bounds. These bounds can, e.g., aid in flagging low-confidence predictions, making them suitable for real-time settings with resource constraints, such as diagnostic monitoring or implantable devices.