Giving Voice to the Constitution: Low-Resource Text-to-Speech for Quechua and Spanish Using a Bilingual Legal Corpus
arXiv cs.AI / 4/16/2026
💬 OpinionSignals & Early TrendsModels & Research
Key Points
- The paper introduces a unified, bilingual text-to-speech pipeline that generates high-quality Quechua and Spanish speech for Peru’s Constitution using XTTS v2, F5-TTS, and DiFlow-TTS.
- It trains models on separate Spanish and Quechua speech datasets with different sizes and recording conditions, then applies bilingual/multilingual TTS features to improve output quality across both languages.
- Cross-lingual transfer is used to reduce the impact of Quechua data scarcity while maintaining naturalness in Spanish.
- The authors release trained checkpoints, inference code, and synthesized audio for each constitutional article, positioning the work as a reusable resource for indigenous and multilingual TTS.
- Overall, the research targets more inclusive speech technology for political and legal content in low-resource linguistic settings.
Related Articles

Black Hat Asia
AI Business

Introducing Claude Opus 4.7
Anthropic News

AI traffic to US retailers rose 393% in Q1, and it’s boosting their revenue too
TechCrunch

The US Government Fired 40% of an Agency, Then Asked AI to Do Their Jobs
Dev.to

🚀 ROSE: Rethinking Computer Vision as a Retrieval-Augmented 🤖 System
Dev.to