Language Bias under Conflicting Information in Multilingual LLMs

arXiv cs.CL / 4/9/2026

💬 OpinionSignals & Early TrendsIdeas & Deep AnalysisModels & Research

Key Points

  • The paper studies whether multilingual LLMs exhibit biases in how they integrate conflicting information when different conflicting facts are provided in different languages.
  • Using an extended “conflicting needles in a haystack” setup across five languages and multiple multilingual LLM sizes (including GPT-5.2), the authors find most models largely ignore the conflict and confidently produce only one answer.
  • The researchers identify consistent cross-model language-preference effects, including a general bias against Russian and (at the longest context lengths) a bias toward Chinese.
  • The observed language-bias patterns hold for models trained both inside and outside mainland China, but are somewhat stronger for models trained inside mainland China.
  • Overall, the results suggest that multilingual context and training data can drive systematic failure modes in conflict resolution beyond the content itself.

Abstract

Large Language Models (LLMs) have been shown to contain biases in the process of integrating conflicting information when answering questions. Here we ask whether such biases also exist with respect to which language is used for each conflicting piece of information. To answer this question, we extend the conflicting needles in a haystack paradigm to a multilingual setting and perform a comprehensive set of evaluations with naturalistic news domain data in five different languages, for a range of multilingual LLMs of different sizes. We find that all LLMs tested, including GPT-5.2, ignore the conflict and confidently assert only one of the possible answers in the large majority of cases. Furthermore, there is a consistent bias across models in which languages are preferred, with a general bias against Russian and, for the longest context lengths, in favor of Chinese. Both of these patterns are consistent between models trained inside and outside of mainland China, though somewhat stronger in the former category.