Syn-TurnTurk: A Synthetic Dataset for Turn-Taking Prediction in Turkish Dialogues
arXiv cs.CL / 4/16/2026
📰 NewsSignals & Early TrendsIdeas & Deep AnalysisModels & Research
Key Points
- The paper addresses turn-taking timing in Turkish voice chatbots, noting that relying on silence detection often leads to bot interruptions due to irregular human pauses.
- It introduces Syn-TurnTurk, a synthetic Turkish dialogue dataset generated with multiple Qwen LLMs to better reflect real exchanges, including overlaps and deliberate silences.
- The study evaluates both traditional and deep learning approaches for turn-taking prediction, reporting strong performance with BI-LSTM and an Ensemble (LR+RF) setup (accuracy 0.839, AUC 0.910).
- The authors argue the dataset can improve models’ ability to detect linguistic cues, which may lead to more natural human-machine interaction in Turkish.
- The work highlights a data-quality gap for Turkish turn-taking prediction and positions synthetic data as a practical route to address it for future research and model development.
Related Articles

Black Hat Asia
AI Business

Introducing Claude Opus 4.7
Anthropic News

AI traffic to US retailers rose 393% in Q1, and it’s boosting their revenue too
TechCrunch

Who Audits the Auditors? Building an LLM-as-a-Judge for Agentic Reliability
Dev.to

"Enterprise AI Cost Optimization: How Companies Are Cutting AI Infrastructure Sp
Dev.to