AI Navigate

インサイト最新記事一覧 AI大全

New TTS Model: VoxCPM2

Reddit r/LocalLLaMA / 4/9/2026

📰 NewsSignals & Early TrendsModels & Research

Read original →

共有:

Key Points

VoxCPM2 is a new text-to-speech (TTS) model that supports three speech-generation modes: Voice Design, Controllable Cloning, and Ultimate Cloning via audio continuation.
The project provides a live demo on Hugging Face (VoxCPM-Demo) and an official model page for VoxCPM2.
VoxCPM2 reports state-of-the-art or competitive performance across major zero-shot and controllable TTS benchmarks.
Benchmark results are referenced via the associated GitHub repository, including Seed-TTS-eval, CV3-eval, InstructTTSEval, and MiniMax Multilingual Test.

VoxCPM2 — Three Modes of Speech Generation:

🎨 Voice Design — Create a brand-new voice

🎛️ Controllable Cloning — Clone a voice with optional style guidance

🎙️ Ultimate Cloning — Reproduce every vocal nuance through audio continuation

Demo

https://huggingface.co/spaces/openbmb/VoxCPM-Demo

Performance

VoxCPM2 achieves state-of-the-art or competitive results on major zero-shot and controllable TTS benchmarks.

See the GitHub repo for full benchmark tables (Seed-TTS-eval, CV3-eval, InstructTTSEval, MiniMax Multilingual Test).

https://huggingface.co/openbmb/VoxCPM2

submitted by /u/foldl-li
[link] [comments]

Related Articles

Black Hat Asia

Black Hat Asia

AI Business

Whats'App Ai Assistant

Whats'App Ai Assistant

Dev.to

AI Agents Now Have Credit Cards

AI Agents Now Have Credit Cards

Dev.to

I Built a $70K Security Bounty Pipeline with AI — Here's the Exact Workflow

I Built a $70K Security Bounty Pipeline with AI — Here's the Exact Workflow

Dev.to

It's insane how lobotomized Opus 4.6 is right now. Even Gemma 4 31B UD IQ3 XXS beat it on the carwash test on my 5070 TI.

It's insane how lobotomized Opus 4.6 is right now. Even Gemma 4 31B UD IQ3 XXS beat it on the carwash test on my 5070 TI.

Reddit r/LocalLLaMA

関連おすすめサービス

※当サイトはアフィリエイト広告を利用しています

Notta搭載AI議事録イヤホン ZENCHORD1

AI時代の仕事術。Notta搭載で会議の議事録を自動生成するスマートイヤホン。

AI搭載ボイスレコーダー Plaud

世界100万人が愛用。AIで文字起こし・要約を自動化するボイスレコーダー。

画像高画質化AIツール Aiarty Image Enhancer

AIで画像を高画質化。写真・イラストを簡単にアップスケール。