Is Cross-Lingual Transfer in Bilingual Models Human-Like? A Study with Overlapping Word Forms in Dutch and English

arXiv cs.CL / 4/9/2026

💬 OpinionIdeas & Deep AnalysisModels & Research

共有:

Key Points

The study investigates whether bilingual Dutch-English causal Transformer language models show cross-lingual activation patterns similar to human bilingual readers, focusing on cognates (friends) and interlingual homographs (false friends).
By training models under four vocabulary-sharing regimes that control whether shared (or false) friends receive shared vs language-specific embeddings, the researchers test cross-lingual effects using surprisal and embedding-similarity measures.
The models mostly preserve language separation, and cross-lingual effects emerge primarily when embeddings for overlapping forms are shared across languages.
When embeddings are shared, the models exhibit facilitation for both friends and false friends (relative to controls), but regression analyses suggest these behaviors are driven more by word frequency than by consistent form-to-meaning mapping.
The human-like qualitative pattern is reproduced only in the condition where embeddings are shared for friends but not for false friends, implying that the way lexical overlap is encoded limits the models’ adequacy as explanations for bilingual reading.

Abstract

Bilingual speakers show cross-lingual activation during reading, especially for words with shared surface form. Cognates (friends) typically lead to facilitation, whereas interlingual homographs (false friends) cause interference or no effect. We examine whether cross-lingual activation in bilingual language models mirrors these patterns. We train Dutch-English causal Transformers under four vocabulary-sharing conditions that manipulate whether (false) friends receive shared or language-specific embeddings. Using psycholinguistic stimuli from bilingual reading studies, we evaluate the models through surprisal and embedding similarity analyses. The models largely maintain language separation, and cross-lingual effects arise primarily when embeddings are shared. In these cases, both friends and false friends show facilitation relative to controls. Regression analyses reveal that these effects are mainly driven by frequency rather than consistency in form-meaning mapping. Only when just friends share embeddings are the qualitative patterns of bilinguals reproduced. Overall, bilingual language models capture some cross-linguistic activation effects. However, their alignment with human processing seems to critically depend on how lexical overlap is encoded, possibly limiting their explanatory adequacy as models of bilingual reading.

Big Tech firms are accelerating AI investments and integration, while regulators and companies focus on safety and responsible adoption.

Dev.to

Could it be that this take is not too far fetched?

Reddit r/LocalLLaMA

npm audit Is Broken — Here's the Claude Code Skill I Built to Fix It

Dev.to

Meta Launches Muse Spark: A New AI Model for Everyday Use

Dev.to

TurboQuant on a MacBook: building a one-command local stack with Ollama, MLX, and an automatic routing proxy

Dev.to

Is Cross-Lingual Transfer in Bilingual Models Human-Like? A Study with Overlapping Word Forms in Dutch and English

Key Points

Abstract

Related Articles

Big Tech firms are accelerating AI investments and integration, while regulators and companies focus on safety and responsible adoption.

Could it be that this take is not too far fetched?

npm audit Is Broken — Here's the Claude Code Skill I Built to Fix It

Meta Launches Muse Spark: A New AI Model for Everyday Use

TurboQuant on a MacBook: building a one-command local stack with Ollama, MLX, and an automatic routing proxy

関連おすすめサービス

Notta搭載AI議事録イヤホン ZENCHORD1

AI搭載ボイスレコーダー Plaud

画像高画質化AIツール Aiarty Image Enhancer