The Mass Agreement Score: A Point-centric Measure of Cluster Size Consistency

arXiv stat.ML / 3/26/2026

📰 NewsSignals & Early TrendsIdeas & Deep AnalysisModels & Research

Key Points

  • The paper proposes the Mass Agreement Score (MAS), a point-centric clustering metric for assessing cluster size uniformity while avoiding issues caused by unstable, non-fixed cluster labels.
  • MAS is bounded in [0, 1] and is designed to be stable, so partitions with only slight point assignment changes receive similar uniformity scores.
  • The method targets a key difficulty in clustering evaluation: label-count perturbations can make label-dependent metrics unreliable even when the underlying data distribution changes minimally.
  • MAS is constructed to provide “fragment robustness,” giving similar scores to partitions with similar bulk cluster structure while still detecting genuine redistribution of cluster mass.
  • The work is presented as a new arXiv announcement (v1), introducing MAS as a novel evaluation approach for clustering partitions that can be used to filter undesirable, size-dominant clusterings.

Abstract

In clustering, strong dominance in the size of a particular cluster is often undesirable, motivating a measure of cluster size uniformity that can be used to filter such partitions. A basic requirement of such a measure is stability: partitions that differ only slightly in their point assignments should receive similar uniformity scores. A difficulty arises because cluster labels are not fixed objects; algorithms may produce different numbers of labels even when the underlying point distribution changes very little. Measures defined directly over labels can therefore become unstable under label-count perturbations. I introduce the Mass Agreement Score (MAS), a point-centric metric bounded in [0, 1] that evaluates the consistency of expected cluster size as measured from the perspective of points in each cluster. Its construction yields fragment robustness by design, assigning similar scores to partitions with similar bulk structure while remaining sensitive to genuine redistribution of cluster mass.