The Rashomon Effect for Visualizing High-Dimensional Data

arXiv cs.LG / 4/2/2026

💬 OpinionIdeas & Deep AnalysisModels & Research

Key Points

  • The paper formalizes the “Rashomon set” for dimension reduction (DR), arguing that multiple distinct embeddings can be equally good while differing in geometry and layout.
  • It proposes PCA-informed alignment to make DR axes more interpretable while protecting local neighborhood structure.
  • It introduces concept-alignment regularization to align embedding dimensions with external signals like class labels or user-defined concepts.
  • It presents a way to derive shared, trustworthy structure across the Rashomon set by identifying persistent nearest-neighbor relationships to build refined embeddings.
  • Overall, the work frames DR visualization as an intentionally multi-solution problem to improve interpretability, robustness, and goal alignment.

Abstract

Dimension reduction (DR) is inherently non-unique: multiple embeddings can preserve the structure of high-dimensional data equally well while differing in layout or geometry. In this paper, we formally define the Rashomon set for DR -- the collection of `good' embedding -- and show how embracing this multiplicity leads to more powerful and trustworthy representations. Specifically, we pursue three goals. First, we introduce PCA-informed alignment to steer embeddings toward principal components, making axes interpretable without distorting local neighborhoods. Second, we design concept-alignment regularization that aligns an embedding dimension with external knowledge, such as class labels or user-defined concepts. Third, we propose a method to extract common knowledge across the Rashomon set by identifying trustworthy and persistent nearest-neighbor relationships, which we use to construct refined embeddings with improved local structure while preserving global relationships. By moving beyond a single embedding and leveraging the Rashomon set, we provide a flexible framework for building interpretable, robust, and goal-aligned visualizations.

The Rashomon Effect for Visualizing High-Dimensional Data | AI Navigate