AnimeScore: A Preference-Based Dataset and Framework for Evaluating Anime-Like Speech Style
arXiv cs.CL / 3/13/2026
💬 OpinionTools & Practical UsageModels & Research
Key Points
- AnimeScore proposes a preference-based framework using pairwise ranking to evaluate anime-like speech, addressing the lack of an absolute MOS-style metric.
- The study collects 15,000 pairwise judgments from 187 evaluators with free-form descriptions to capture nuanced perceptions beyond simple pitch cues.
- Acoustic analysis reveals that anime-likeness arises from controlled resonance shaping, prosodic continuity, and deliberate articulation rather than high pitch alone.
- Handcrafted features reach 69.3% AUC, while SSL-based ranking models achieve up to 90.8% AUC, indicating strong predictive performance and potential as a reward signal for generative speech optimization.
- The dataset and framework provide a practical metric for evaluating anime-like speech and can guide improvements in anime-style voice generation.
Related Articles

Astral to Join OpenAI
Dev.to

I Built a MITM Proxy to See What Claude Code Actually Sends to Anthropic
Dev.to

Your AI coding agent is installing vulnerable packages. I built the fix.
Dev.to

ChatGPT Prompt Engineering for Freelancers: Unlocking Efficient Client Communication
Dev.to

PearlOS. We gave swarm intelligence a local desktop environment and code control to self-evolve. Has been pretty incredible to see so far. Open source and free if you want your own.
Reddit r/LocalLLaMA