AI Navigate

インサイトインサイト最新記事最新記事一覧 AI大全AI大全カオスマップAIカオスマップ

mtmd: qwen3 audio support (qwen3-omni and qwen3-asr)

Reddit r/LocalLLaMA / 4/13/2026

📰 NewsDeveloper Stack & InfrastructureSignals & Early TrendsTools & Practical Usage

Read original →

共有:

Key Points

The post reports working audio support for Qwen3 variants in llama.cpp, specifically qwen3-omni (with vision + audio input) and qwen3-asr.
Functionality is demonstrated via an implementation referenced as a llama.cpp pull request, indicating the feature is being integrated upstream.
The update targets local/bring-your-own-model workflows (“LocalLLaMA”), enabling developers to experiment with multimodal audio capabilities on-device.
It suggests improving readiness for real-time or interactive audio-to-understanding pipelines using Qwen3-based models in the llama.cpp ecosystem.

mtmd: qwen3 audio support (qwen3-omni and qwen3-asr)

qwen3-omni-moe working (vision + audio input)
qwen3-asr working

submitted by /u/jacek2023
[link] [comments]

Related Articles

Black Hat USA

Black Hat USA

AI Business

Black Hat Asia

Black Hat Asia

AI Business

v0.20.6

v0.20.6

Ollama Releases

Are Data Centers Sitting On A Goldmine Of Wasted Energy?

Are Data Centers Sitting On A Goldmine Of Wasted Energy?

Reddit r/artificial

Anthropic Launches Project Glasswing for AI Security

Dev.to

関連おすすめサービス

※当サイトはアフィリエイト広告を利用しています

Notta搭載AI議事録イヤホン ZENCHORD1

AI時代の仕事術。Notta搭載で会議の議事録を自動生成するスマートイヤホン。

AI搭載ボイスレコーダー Plaud

世界100万人が愛用。AIで文字起こし・要約を自動化するボイスレコーダー。

画像高画質化AIツール Aiarty Image Enhancer

AIで画像を高画質化。写真・イラストを簡単にアップスケール。