Not All Subjectivity Is the Same! Defining Desiderata for the Evaluation of Subjectivity in NLP

arXiv cs.CL / 3/31/2026

💬 OpinionSignals & Early TrendsIdeas & Deep AnalysisModels & Research

共有:

Key Points

The paper argues that not all forms of subjectivity in NLP are equivalent and proposes seven evaluation desiderata tailored to subjectivity-sensitive models.
It frames the desiderata around how subjectivity appears in datasets and how models represent or generate it, with a focus on user-centric outcomes such as visibility of minority perspectives.
The authors review the experimental setups of 60 related papers and find several persistent research gaps, including insufficient study of ambiguous versus polyphonic inputs.
The review also highlights evaluation shortcomings such as whether subjectivity is actually communicated effectively to users and a lack of consideration for how different desiderata interact with each other.

Abstract

Subjective judgments are part of several NLP datasets and recent work is increasingly prioritizing models whose outputs reflect this diversity of perspectives. Such responses allow us to shed light on minority voices, which are frequently marginalized or obscured by dominant perspectives. It remains a question whether our evaluation practices align with these models' objectives. This position paper proposes seven evaluation desiderata for subjectivity-sensitive models, rooted in how subjectivity is represented in NLP data and models. The desiderata are constructed in a top-down approach, keeping in mind the user-centric impact of such models. We scan the experimental setup of 60 papers and show that various aspects of subjectivity are still understudied: the distinction between ambiguous and polyphonic input, whether subjectivity is effectively expressed to the user, and a lack of interplay between different desiderata, amongst other gaps.

Black Hat Asia

AI Business

Just a helpful open-source contributor

Reddit r/LocalLLaMA

v0.18.2rc0

vLLM Releases

South Korean AI Chipmaker Raises $400 Million for Inference

AI Business

Ollama is now powered by MLX on Apple Silicon in preview

Dev.to

Not All Subjectivity Is the Same! Defining Desiderata for the Evaluation of Subjectivity in NLP

Key Points

Abstract

Related Articles

Black Hat Asia

Just a helpful open-source contributor

v0.18.2rc0

South Korean AI Chipmaker Raises $400 Million for Inference

Ollama is now powered by MLX on Apple Silicon in preview

関連おすすめサービス

Notta搭載AI議事録イヤホン ZENCHORD1

AI搭載ボイスレコーダー Plaud

画像高画質化AIツール Aiarty Image Enhancer