AI Navigate

GATED_DELTA_NET for vulkan merged in llama.cpp

Reddit r/LocalLLaMA / 3/13/2026

📰 NewsDeveloper Stack & InfrastructureTools & Practical UsageModels & Research

Read original →

共有:

Key Points

GATED_DELTA_NET for Vulkan has been merged into llama.cpp and is available in the latest release via PR 20334.
On AMD RX7800XT systems running Fedora Linux, the change yields a notable performance boost, increasing Qwen 3.5 27B token generation speed from about 28t/s to about 36t/s.
The improvement comes from the gated delta-net for Vulkan implementation integrated into llama.cpp.
This update can enhance Vulkan-based inference workloads for users deploying Llama-derived models, potentially improving throughput and efficiency.

https://github.com/ggml-org/llama.cpp/pull/20334
It would be already in the latest release.

There is a performance boost in my AMD RX7800XT setup (Fedora Linux).
For Qwen 3.5 27B, token generation was ~28t/s.
It is now ~36t/s.

submitted by /u/FancyImagination880
[link] [comments]

Related Articles

NVIDIA、GTC 2026で次世代AI基盤を発表「Vera Rubin」を軸にエージェント・ゲーム・宇宙領域へ展開のサムネイル画像

NVIDIA、GTC 2026で次世代AI基盤を発表「Vera Rubin」を軸にエージェント・ゲーム・宇宙領域へ展開のサムネイル画像

Ledge.ai

1Password、AIエージェントのアクセス制御を統合管理する「Unified Access」発表人間・マシン・AIの資格情報を一元統制のサムネイル画像

1Password、AIエージェントのアクセス制御を統合管理する「Unified Access」発表人間・マシン・AIの資格情報を一元統制のサムネイル画像

Ledge.ai

『モンドーモンドー』｜夏目龍頭流闇文学｜AI画像生成｜自由詩｜散文詩｜ホラー｜ダークファンタジー｜深淵図書館

『モンドーモンドー』｜夏目龍頭流闇文学｜AI画像生成｜自由詩｜散文詩｜ホラー｜ダークファンタジー｜深淵図書館

note

報告：LLMにおける「自己言及的再帰」と「ステートフル・エミュレーション」の観測

報告：LLMにおける「自己言及的再帰」と「ステートフル・エミュレーション」の観測

note

「お金、見直したいけどどこから？」AIが改善ヒントを教えてくれる、公式プロンプトを公開

「お金、見直したいけどどこから？」AIが改善ヒントを教えてくれる、公式プロンプトを公開

note

関連おすすめサービス

※当サイトはアフィリエイト広告を利用しています

Notta搭載AI議事録イヤホン ZENCHORD1

AI時代の仕事術。Notta搭載で会議の議事録を自動生成するスマートイヤホン。

AI搭載ボイスレコーダー Plaud

世界100万人が愛用。AIで文字起こし・要約を自動化するボイスレコーダー。

画像高画質化AIツール Aiarty Image Enhancer

AIで画像を高画質化。写真・イラストを簡単にアップスケール。