Towards Robust and Scalable Density-based Clustering via Graph Propagation

arXiv cs.LG / 5/4/2026

📰 NewsIdeas & Deep AnalysisModels & Research

Key Points

  • The paper introduces CluProp, a framework that reformulates varied-density clustering in high-dimensional spaces as label propagation over neighborhood graphs.
  • It aims to connect density-based clustering with graph connectivity in a principled way, reducing the parameter sensitivity that often affects traditional density-based methods.
  • CluProp uses a deterministic density-based propagation strategy to make neighborhood identification more scalable.
  • The method is distance-metric agnostic and reportedly achieves strong accuracy improvements over existing baselines, including processing millions of points in minutes.

Abstract

We present \textit{CluProp}, a novel framework that reimagines varied-density clustering in high-dimensional spaces as a label propagation process over neighborhood graphs. Our approach formally bridges the gap between density-based clustering and graph connectivity, leveraging efficient propagation mechanisms from network science to mitigate the parameter sensitivity inherent in traditional density-based methods. Specifically, we introduce a deterministic density-based propagation strategy to ensure scalable neighborhood identification. The framework is agnostic to the choice of distance metric and exhibits superior performance on large-scale data, processing millions of points in minutes while consistently outperforming existing baselines in accuracy.