AppTek Call-Center Dialogues: A Multi-Accent Long-Form Benchmark for English ASR

arXiv cs.CL / 5/1/2026

📰 NewsSignals & Early TrendsModels & Research

共有:

Key Points

The AppTek Call-Center Dialogues dataset addresses a key gap in English ASR evaluation by providing spontaneous, role-played long-form call-center conversations with explicit multi-accent coverage.
The corpus spans 14 English accents and 16 service-oriented scenarios, specifically commissioned for evaluation with no prior public release of the audio or text to limit overlap with existing pretraining data.
The study benchmarks multiple open-source ASR systems and varies the segmentation approach to test how preprocessing choices affect recognition quality.
Findings show significant performance differences across accents and segmentation methods, demonstrating that strong results on general American English benchmarks may not transfer to other dialects.
Overall, the work provides a more realistic and robust benchmark for conversational AI use cases that require handling diverse speakers and longer dialogue contexts.

Abstract

Evaluating English ASR systems for conversational AI applications remains difficult, as many publicly available corpora are either pre-segmented into short segments, consist of read or prepared speech, or lack explicit dialect annotations to evaluate robustness for a diverse user base. This work presents the AppTek Call-Center Dialogues corpus, a collection of spontaneous, role-played agent-customer conversations spanning fourteen English accents covering sixteen service-oriented scenarios. The dataset was commissioned specifically for evaluation and none of the audio or text was publicly available prior to release, reducing the risk of overlap with existing large-scale pretraining corpora. We benchmark a set of open-source ASR systems under different segmentation approaches. Results show substantial variation across accents and segmentation methods, indicating that good performance on general American English benchmarks does not necessarily generalize to other accents.

Why Autonomous Coding Agents Keep Failing — And What Actually Works

Dev.to

Text-to-image is easy. Chaining LLMs to generate, critique, and iterate on images autonomously is a routing nightmare. AgentSwarms now supports Image generation playground and creative media workflows!

Reddit r/artificial

Announcing the NVIDIA Nemotron 3 Super Build Contest

Dev.to

75% of Sites Blocking AI Bots Still Get Cited. Here Is Why Blocking Does Not Work.

Dev.to

How to Fix OpenClaw Tool Calling Issues

Dev.to

AppTek Call-Center Dialogues: A Multi-Accent Long-Form Benchmark for English ASR

Key Points

Abstract

Related Articles

Why Autonomous Coding Agents Keep Failing — And What Actually Works

Text-to-image is easy. Chaining LLMs to generate, critique, and iterate on images autonomously is a routing nightmare. AgentSwarms now supports Image generation playground and creative media workflows!

Announcing the NVIDIA Nemotron 3 Super Build Contest

75% of Sites Blocking AI Bots Still Get Cited. Here Is Why Blocking Does Not Work.

How to Fix OpenClaw Tool Calling Issues

関連おすすめサービス

Notta搭載AI議事録イヤホン ZENCHORD1

AI搭載ボイスレコーダー Plaud

画像高画質化AIツール Aiarty Image Enhancer