AI Navigate

A Universal Nearest-Neighbor Estimator for Intrinsic Dimensionality

arXiv cs.LG / 3/12/2026

📰 NewsIdeas & Deep AnalysisModels & Research

Key Points

  • The paper introduces a universal intrinsic dimensionality estimator based on nearest-neighbor distance ratios with simple calculations.
  • It claims state-of-the-art performance across benchmark manifolds and real-world datasets.
  • The estimator is universal, provably converging to the true intrinsic dimensionality regardless of the data-generating distribution.
  • The work highlights limitations of existing methods that rely on geometric or distributional assumptions and demonstrates empirical results to validate the approach.

Abstract

Estimating the intrinsic dimensionality (ID) of data is a fundamental problem in machine learning and computer vision, providing insight into the true degrees of freedom underlying high-dimensional observations. Existing methods often rely on geometric or distributional assumptions and can significantly fail when these assumptions are violated. In this paper, we introduce a novel ID estimator based on nearest-neighbor distance ratios that involves simple calculations and achieves state-of-the-art results. Most importantly, we provide a theoretical analysis proving that our estimator is \emph{universal}, namely, it converges to the true ID independently of the distribution generating the data. We present experimental results on benchmark manifolds and real-world datasets to demonstrate the performance of our estimator.