Robust Language Identification for Romansh Varieties
arXiv cs.CL / 3/18/2026
📰 NewsTools & Practical UsageModels & Research
Key Points
- The paper introduces a language identification system for Romansh varieties (idioms) and Rumantsch Grischun using an SVM-based approach.
- It targets the challenging classification among Romansh idioms and a supra-regional variety, Rumantsch Grischun, as part of the problem.
- The model is evaluated on a newly curated benchmark across two domains and achieves an average in-domain accuracy of 97%.
- The classifier is publicly available and can enable applications such as idiom-aware spell checking or machine translation.




