Tadabur: A Large-Scale Quran Audio Dataset

arXiv cs.AI / 4/22/2026

💬 OpinionModels & Research

共有:

Key Points

The paper introduces Tadabur, a large-scale Quran audio dataset designed to overcome limitations of existing Quran datasets in both size and diversity.
Tadabur includes 1,400+ hours of recitation audio recorded from 600+ distinct reciters, capturing wide variation in recitation styles, vocal traits, and recording conditions.
The dataset is intended to provide a more comprehensive and representative resource for research and analysis of Quranic speech.
By expanding both dataset duration and variability, Tadabur aims to enable future studies and support the creation of standardized Quranic speech benchmarks.

Abstract

Despite growing interest in Quranic data research, existing Quran datasets remain limited in both scale and diversity. To address this gap, we present Tadabur, a large-scale Quran audio dataset. Tadabur comprises more than 1400+ hours of recitation audio from over 600 distinct reciters, providing substantial variation in recitation styles, vocal characteristics, and recording conditions. This diversity makes Tadabur a comprehensive and representative resource for Quranic speech research and analysis. By significantly expanding both the total duration and variability of available Quran data, Tadabur aims to support future research and facilitate the development of standardized Quranic speech benchmarks.

I’m working on an AGI and human council system that could make the world better and keep checks and balances in place to prevent catastrophes. It could change the world. Really. Im trying to get ahead of the game before an AGI is developed by someone who only has their best interest in mind.

Reddit r/artificial

Deepseek V4 Flash and Non-Flash Out on HuggingFace

Reddit r/LocalLLaMA

DeepSeek V4 Flash & Pro Now out on API

Reddit r/LocalLLaMA

From "Hello World" to "Hello Agents": The Developer Keynote That Rewired Software Engineering

Dev.to

AI swarms could hijack democracy without anyone noticing

Reddit r/artificial

Tadabur: A Large-Scale Quran Audio Dataset

Key Points

Abstract

Related Articles

I’m working on an AGI and human council system that could make the world better and keep checks and balances in place to prevent catastrophes. It could change the world. Really. Im trying to get ahead of the game before an AGI is developed by someone who only has their best interest in mind.

Deepseek V4 Flash and Non-Flash Out on HuggingFace

DeepSeek V4 Flash & Pro Now out on API

From "Hello World" to "Hello Agents": The Developer Keynote That Rewired Software Engineering

AI swarms could hijack democracy without anyone noticing

関連おすすめサービス

Notta搭載AI議事録イヤホン ZENCHORD1

AI搭載ボイスレコーダー Plaud

画像高画質化AIツール Aiarty Image Enhancer