Controllable Spoken Dialogue Generation: An LLM-Driven Grading System for K-12 Non-Native English Learners
arXiv cs.AI / 4/27/2026
💬 OpinionTools & Practical UsageModels & Research
Key Points
- The paper proposes a proficiency-aligned framework that adapts LLM-generated spoken dialogues to K-12 non-native English learners, addressing problems caused by mismatches between model output and learner ability.
- It introduces a four-tier grading approach to precisely control lexical complexity, tailored to China’s national curriculum (CSE) as an example but intended to be adaptable to other educational standards.
- The core contribution is the DDPO algorithm (Diversity Driven Policy Optimization), a multi-turn, GRPO-based method aimed at maintaining dialogue diversity while jointly improving dialogue quality.
- Experiments report lower out-of-vocabulary rates and higher diversity, alongside improvements in conversational naturalness and pedagogical usefulness.
- The authors plan to open-source the models, data, and code, providing resources such as graded vocabulary lists and a multi-turn dialogue corpus for personalized English speaking practice.




