ID-Sim: An Identity-Focused Similarity Metric

arXiv cs.CV / 4/8/2026

📰 NewsSignals & Early TrendsIdeas & Deep AnalysisModels & Research

Key Points

  • The paper introduces ID-Sim, an identity-focused, feed-forward similarity metric intended to mirror humans’ ability to discern highly similar identities across changes in context such as viewpoints and lighting.
  • It builds ID-Sim using a curated, high-quality training dataset spanning diverse real-world domains, supplemented with generative synthetic data for controlled, fine-grained identity and contextual variations.
  • The authors propose evaluating ID-Sim on a new unified benchmark that tests alignment with human annotations across identity-focused recognition, retrieval, and generative tasks.
  • The work targets a key gap in personalized image generation and other identity-centric applications: the lack of evaluation metrics specifically designed for identity consistency.

Abstract

Humans have remarkable selective sensitivity to identities -- easily distinguishing between highly similar identities, even across significantly different contexts such as diverse viewpoints or lighting. Vision models have struggled to match this capability, and progress toward identity-focused tasks such as personalized image generation is slowed by a lack of identity-focused evaluation metrics. To help facilitate progress, we propose ID-Sim, a feed-forward metric designed to faithfully reflect human selective sensitivity. To build ID-Sim, we curate a high-quality training set of images spanning diverse real-world domains, augmented with generative synthetic data that provides controlled, fine-grained identity and contextual variations. We evaluate our metric on a new unified evaluation benchmark for assessing consistency with human annotations across identity-focused recognition, retrieval, and generative tasks.