Transfer Learning for an Endangered Slavic Variety: Dependency Parsing in Pomak Across Contact-Shaped Dialects
arXiv cs.CL / 3/31/2026
📰 NewsIdeas & Deep AnalysisModels & Research
Key Points
- The paper introduces new research resources and baseline dependency-parsing experiments for Pomak, an endangered Eastern South Slavic language with strong dialect variation and limited standardization.
- It tests cross-dialect transfer by training a parser on the Pomak Universal Dependencies treebank primarily derived from the Greece variety and evaluating zero-shot performance on the Turkey (Uzunköprü) variety.
- The study quantifies how phonological and morphosyntactic differences between dialects affect parsing accuracy under zero-shot transfer.
- A new manually annotated Turkish-variety Pomak corpus of 650 sentences is released/used, and the authors show that targeted fine-tuning yields substantial accuracy gains even with the small dataset.
- Combining cross-variety transfer learning from both dialects further improves performance beyond fine-tuning alone.
Related Articles
[D] How does distributed proof of work computing handle the coordination needs of neural network training?
Reddit r/MachineLearning

BYOK is not just a pricing model: why it changes AI product trust
Dev.to

AI Citation Registries and Identity Persistence Across Records
Dev.to

Building Real-Time AI Voice Agents with Google Gemini 3.1 Flash Live and VideoSDK
Dev.to

Your Knowledge, Your Model: A Method for Deterministic Knowledge Externalization
Dev.to