A Super Fast K-means for Indexing Vector Embeddings

arXiv cs.LG / 3/23/2026

📰 NewsTools & Practical UsageModels & Research

共有:

Key Points

SuperKMeans is a k-means variant designed for clustering high-dimensional vector embeddings, achieving up to 7x faster CPU performance than FAISS and Scikit-Learn and up to 4x faster than cuVS on GPUs, while preserving the quality of the resulting centroids for retrieval tasks.
The acceleration comes from pruning dimensions that are not needed to assign a vector to a centroid, reducing data-access and compute overhead.
They introduce Early Termination by Recall, a mechanism that early-terminates k-means when the quality of the centroids for retrieval tasks stops improving across iterations, further reducing runtimes without compromising retrieval quality.
They open-source their implementation at https://github.com/cwida/SuperKMeans.

Abstract

We present SuperKMeans: a k-means variant designed for clustering collections of high-dimensional vector embeddings. SuperKMeans' clustering is up to 7x faster than FAISS and Scikit-Learn on modern CPUs and up to 4x faster than cuVS on GPUs (Figure 1), while maintaining the quality of the resulting centroids for vector similarity search tasks. SuperKMeans acceleration comes from reducing data-access and compute overhead by reliably and efficiently pruning dimensions that are not needed to assign a vector to a centroid. Furthermore, we present Early Termination by Recall, a novel mechanism that early-terminates k-means when the quality of the centroids for retrieval tasks stops improving across iterations. In practice, this further reduces runtimes without compromising retrieval quality. We open-source our implementation at https://github.com/cwida/SuperKMeans

Day 10: An AI Agent's Revenue Report — $29, 25 Products, 160 Tweets

Dev.to

Does Synthetic Data Generation of LLMs Help Clinical Text Mining?

Dev.to

What CVE-2026-25253 Taught Me About Building Safe AI Assistants

Dev.to

Krish Naik: AI Learning Path For 2026- Data Science, Generative and Agentic AI Roadmap

Dev.to

Day 52: Building vs Shipping — Why We Had 711 Commits and 0 Users

Dev.to

A Super Fast K-means for Indexing Vector Embeddings

Key Points

Abstract

Related Articles

Day 10: An AI Agent's Revenue Report — $29, 25 Products, 160 Tweets

Does Synthetic Data Generation of LLMs Help Clinical Text Mining?

What CVE-2026-25253 Taught Me About Building Safe AI Assistants

Krish Naik: AI Learning Path For 2026- Data Science, Generative and Agentic AI Roadmap

Day 52: Building vs Shipping — Why We Had 711 Commits and 0 Users

関連おすすめサービス

Notta搭載AI議事録イヤホン ZENCHORD1

AI搭載ボイスレコーダー Plaud

画像高画質化AIツール Aiarty Image Enhancer