Utterance-Level Methods for Identifying Reliable ASR-Output for Child Speech
arXiv cs.CL / 4/23/2026
💬 OpinionModels & Research
Key Points
- The paper addresses high ASR error rates in applications using child speech by proposing methods to pre-identify which utterance-level ASR outputs are reliable.
- It introduces two utterance-level selection approaches: one tailored for reliable read speech and another for reliable dialogue speech.
- Experiments on English and Dutch datasets (with both baseline and fine-tuned ASR models) show that the best strategy achieves high precision (P > 97.4) for both speech types and both languages.
- The optimal selection strategy enables automatic selection of 21.0% to 55.9% of dialogue/read speech datasets while keeping utterance error rates low (UER < 2.6).
Related Articles

Trajectory Forecasts in Unknown Environments Conditioned on Grid-Based Plans
Dev.to

OpenAI Just Named It Workspace Agents. We Open-Sourced Our Lark Version Six Months Ago
Dev.to

GPT Image 2 Subject-Lock Editing: A Practical Guide to input_fidelity
Dev.to

GPT Image 2 vs DALL-E 3: What Actually Changed in OpenAI's New Image Model
Dev.to

AI Tutor for Science Students — Physics Chemistry Biology Solved by AI
Dev.to