BlasBench: An Open Benchmark for Irish Speech Recognition

arXiv cs.CL / 4/14/2026

📰 NewsTools & Practical UsageModels & Research

共有:

Key Points

BlasBench is an open Irish-specific ASR evaluation harness that includes Irish-aware text normalisation to preserve linguistic features like fadas, lenition, and eclipsis.
The benchmark evaluates 12 end-user ASR systems across four architecture families using Common Voice ga-IE and FLEURS ga-IE under a shared evaluation protocol.
Results show that all Whisper variants exceed 100% WER, highlighting challenges for current models on Irish speech recognition.
The best open model, omniASR LLM 7B, achieves 30.65% WER on Common Voice and 39.09% WER on FLEURS, setting a new baseline for open Irish ASR.
A key finding is a cross-dataset generalisation gap: models fine-tuned on Common Voice lose 33–43 WER points on FLEURS, which can be missed when testing on a single dataset.

Abstract

No open Irish-specific benchmark compares end-user ASR systems under a shared Irish-aware evaluation protocol. To solve this, we release BlasBench, an open evaluation harness with Irish-aware text normalisation that preserves fadas, lenition, and eclipsis. We benchmark 12 systems across four architecture families on Common Voice ga-IE and FLEURS ga-IE. All Whisper variants exceed 100% WER. The best open model (omniASR LLM 7B) achieves 30.65% WER on Common Voice and 39.09% on FLEURS. We noticed models fine-tuned on Common Voice lose 33-43 WER points on FLEURS, revealing a generalisation gap that is invisible to single-dataset evaluation.

Black Hat USA

AI Business

Black Hat Asia

AI Business

Don't forget, there is more than forgetting: new metrics for Continual Learning

Dev.to

Microsoft MAI-Image-2-Efficient Review 2026: The AI Image Model Built for Production Scale

Dev.to

Bit of a strange question?

Reddit r/artificial

BlasBench: An Open Benchmark for Irish Speech Recognition

Key Points

Abstract

Related Articles

Black Hat USA

Black Hat Asia

Don't forget, there is more than forgetting: new metrics for Continual Learning

Microsoft MAI-Image-2-Efficient Review 2026: The AI Image Model Built for Production Scale

Bit of a strange question?

関連おすすめサービス

Notta搭載AI議事録イヤホン ZENCHORD1

AI搭載ボイスレコーダー Plaud

画像高画質化AIツール Aiarty Image Enhancer