Scripts Through Time: A Survey of the Evolving Role of Transliteration in NLP
arXiv cs.CL / 4/22/2026
💬 OpinionIdeas & Deep AnalysisModels & Research
Key Points
- The paper argues that transliteration helps overcome the “script barrier” that limits cross-lingual transfer in NLP by increasing lexical overlap between languages.
- It surveys and categorizes motivations for using transliteration in language models, and reviews multiple ways to incorporate transliteration as model input.
- The authors trace how transliteration methods have evolved over time and assess their effectiveness, highlighting key trade-offs that affect performance.
- The survey identifies practical scenarios where transliteration is especially useful, such as code-mixed text handling and exploiting language-family relatedness, with potential inference-efficiency benefits.
- It concludes with concrete, research-oriented recommendations for choosing and implementing transliteration strategies based on target language, task requirements, and resource constraints.


