A Japanese Benchmark for Evaluating Social Bias in Reasoning Based on Attribution Theory

arXiv cs.CL / 4/3/2026

📰 NewsIdeas & Deep AnalysisModels & Research

共有:

Key Points

The paper argues that existing Japanese LLM social-bias benchmarks are often inadequate because they mainly translate English data, which can miss Japan-specific cultural context.
It introduces JUBAKU-v2, a Japanese dataset built using attribution theory to evaluate bias in the reasoning process (who is blamed/attributed actions to), while keeping the final conclusion fixed.
JUBAKU-v2 contains 216 examples designed to reflect cultural biases between in-groups and out-groups in Japan.
Experiments show the benchmark can differentiate model performance more sensitively than prior benchmarks, particularly for detecting bias patterns embedded in reasoning rather than only in outputs.
The work focuses on more fine-grained fairness evaluation for LLMs by capturing “hidden” bias signals during intermediate reasoning steps.

Abstract

In enhancing the fairness of Large Language Models (LLMs), evaluating social biases rooted in the cultural contexts of specific linguistic regions is essential. However, most existing Japanese benchmarks heavily rely on translating English data, which does not necessarily provide an evaluation suitable for Japanese culture. Furthermore, they only evaluate bias in the conclusion, failing to capture biases lurking in the reasoning. In this study, based on attribution theory in social psychology, we constructed a new dataset, ``JUBAKU-v2,'' which evaluates the bias in attributing behaviors to in-groups and out-groups within reasoning while fixing the conclusion. This dataset consists of 216 examples reflecting cultural biases specific to Japan. Experimental results verified that it can detect performance differences across models more sensitively than existing benchmarks.

90000 Tech Workers Got Fired This Year and Everyone Is Blaming AI but Thats Not the Whole Story

Dev.to

Microsoft’s $10 Billion Japan Bet Shows the Next AI Battleground Is National Infrastructure

Dev.to

TII Releases Falcon Perception: A 0.6B-Parameter Early-Fusion Transformer for Open-Vocabulary Grounding and Segmentation from Natural Language Prompts

MarkTechPost

The house asked me a question

Dev.to

Precision Clip Selection: How AI Suggests Your In and Out Points

Dev.to

A Japanese Benchmark for Evaluating Social Bias in Reasoning Based on Attribution Theory

Key Points

Abstract

Related Articles

90000 Tech Workers Got Fired This Year and Everyone Is Blaming AI but Thats Not the Whole Story

Microsoft’s $10 Billion Japan Bet Shows the Next AI Battleground Is National Infrastructure

TII Releases Falcon Perception: A 0.6B-Parameter Early-Fusion Transformer for Open-Vocabulary Grounding and Segmentation from Natural Language Prompts

The house asked me a question

Precision Clip Selection: How AI Suggests Your In and Out Points

関連おすすめサービス

Notta搭載AI議事録イヤホン ZENCHORD1

AI搭載ボイスレコーダー Plaud

画像高画質化AIツール Aiarty Image Enhancer