Contextual Earnings-22: A Speech Recognition Benchmark with Custom Vocabulary in the Wild
arXiv cs.CL / 4/10/2026
📰 NewsSignals & Early TrendsIdeas & Deep AnalysisModels & Research
Key Points
- The paper argues that speech-to-text progress appears to be limited on academic benchmarks because they overrepresent common vocabulary, while real-world usability depends heavily on recognizing rare, context-specific custom terms.
- It introduces Contextual Earnings-22, a new open dataset derived from Earnings-22 that adds realistic custom vocabulary contexts to better measure real-world transcription performance.
- The authors provide six strong baseline models for two leading strategies—keyword prompting and keyword boosting—to support comparable research evaluation.
- Experiments indicate that both approaches achieve comparable accuracy and show significant gains when moving from small proof-of-concept setups to large-scale systems.
Related Articles

Black Hat Asia
AI Business

GLM 5.1 tops the code arena rankings for open models
Reddit r/LocalLLaMA

Big Tech firms are accelerating AI investments and integration, while regulators and companies focus on safety and responsible adoption.
Dev.to

My Bestie Built a Free MCP Server for Job Search — Here's How It Works
Dev.to
can we talk about how AI has gotten really good at lying to you?
Reddit r/artificial