Abjad-Kids: An Arabic Speech Classification Dataset for Primary Education
arXiv cs.CL / 3/24/2026
📰 NewsSignals & Early TrendsModels & Research
Key Points
- The paper introduces Abjad-Kids, a new publicly planned Arabic speech dataset for kindergarten and primary education that targets learning alphabets, numbers, and colors for children aged 3–12.
- The dataset includes 46,397 controlled-recording audio samples spanning 141 classes, with standardized duration, sampling rate, and format to improve research consistency.
- To handle high similarity among Arabic phonemes and limited per-class data, the authors propose a hierarchical CNN-LSTM audio classification approach that uses two-stage grouping plus specialized classifiers.
- Experiments show that static linguistic-based grouping performs better than dynamic clustering-based grouping, and that CNN-LSTM models with data augmentation outperform traditional baselines and other deep learning approaches.
- The study reports persistent overfitting challenges despite augmentation and regularization, indicating that future expansion of the dataset is needed.
Related Articles

Black Hat Asia
AI Business
Top 5 LLM Gateway Alternatives After the LiteLLM Supply Chain Attack
Dev.to

Reliable Function Calling in Deeply Recursive Union Types: Fixing Qwen Models' Double-Stringify Bug
Dev.to

5 Real Issues With LiteLLM That Are Pushing Teams Away in 2026
Dev.to

Big Tech firms are accelerating AI investments and integration, while regulators and companies focus on safety and responsible adoption.
Dev.to