AnimeScore: A Preference-Based Dataset and Framework for Evaluating Anime-Like Speech Style
arXiv cs.CL / 3/13/2026
💬 OpinionTools & Practical UsageModels & Research
Key Points
- AnimeScore proposes a preference-based framework using pairwise ranking to evaluate anime-like speech, addressing the lack of an absolute MOS-style metric.
- The study collects 15,000 pairwise judgments from 187 evaluators with free-form descriptions to capture nuanced perceptions beyond simple pitch cues.
- Acoustic analysis reveals that anime-likeness arises from controlled resonance shaping, prosodic continuity, and deliberate articulation rather than high pitch alone.
- Handcrafted features reach 69.3% AUC, while SSL-based ranking models achieve up to 90.8% AUC, indicating strong predictive performance and potential as a reward signal for generative speech optimization.
- The dataset and framework provide a practical metric for evaluating anime-like speech and can guide improvements in anime-style voice generation.




