Associativity-Peakiness Metric for Contingency Tables

arXiv cs.LG / 4/27/2026

📰 NewsIdeas & Deep AnalysisModels & Research

Key Points

  • The paper introduces the Associativity Peakiness (AP) metric to evaluate clustering performance when results are represented as contingency tables.
  • It argues that existing metrics for vector-based truth/prediction pairs do not capture the detailed structural features visible in contingency tables.
  • The AP metric is presented as an analogue to confusion-matrix quality measures used in supervised learning.
  • Simulation results using 500 generated contingency tables across multiple scenarios indicate AP provides higher dynamic range than publicly available metrics.
  • The authors also report that AP is more computationally efficient than comparable existing metrics for this evaluation setting.

Abstract

For the use case of comparing the performance of clustering algorithms whose output is a contingency table, a single performance metric for contingency tables is needed. Such a metric is vital for comparative performance analysis of clustering algorithms. A survey of publicly available literature did not show the presence of such a metric. Metrics do exist for vector pairs of truth values and predicted values, which are an alternative form of output of clustering algorithms. However, the metrics for vector pairs do not reveal the presence of detailed features that are apparent in contingency tables. This paper presents the Associativity Peakiness (AP) metric, which characterizes aspects of clustering algorithm performance that are critical for predicting a clustering algorithm's performance when deployed. The AP metric is analogous to measures of quality for confusion matrices that are outputs of supervised learning algorithms. This paper presents results from simulations in which 500 contingency tables were generated for multiple test scenarios. The results show that for the use case of evaluating clustering algorithms, the AP metric characterizes performance of contingency tables with higher dynamic range than publicly available metrics, and that it is computationally more efficient than comparable publicly available metrics.