Local Hessian Spectral Filtering for Robust Intrinsic Dimension Estimation

arXiv cs.LG / 5/5/2026

📰 NewsModels & Research

Key Points

  • The paper targets a key weakness of existing Local Intrinsic Dimension (LID) estimators for diffusion models: in high-dimensional settings, noise from normal directions can dominate the tangent signal.
  • It introduces Local Hessian Spectral Dimension (LHSD), which estimates intrinsic dimensionality by spectral-filtering the log-density Hessian and truncating large eigenvalues linked to normal directions.
  • LHSD is implemented via Stochastic Lanczos Quadrature (SLQ), avoiding expensive full Hessian construction while reportedly achieving linear scalability with the ambient dimension D.
  • Experiments on both synthetic and real datasets show LHSD is more robust than prior approaches and can help detect memorization behavior in large-scale diffusion models.

Abstract

While diffusion models enable new approaches for estimating Local Intrinsic Dimension (LID), existing methods fail in high-dimensional spaces where noise from vast normal directions overwhelms the tangent signal. We propose Local Hessian Spectral Dimension (LHSD), which resolves this by applying spectral filtering to the log-density Hessian, explicitly cutting off large eigenvalues associated with normal directions to count zero-curvature tangent directions. Implemented using Stochastic Lanczos Quadrature (SLQ), LHSD avoids full Hessian construction, achieving linear scalability with dimension D. Experiments on synthetic and real data confirm LHSD's superior robustness and its utility in detecting memorization in large-scale diffusion models.

Local Hessian Spectral Filtering for Robust Intrinsic Dimension Estimation | AI Navigate