AI Navigate

v0.17.1

vLLM Releases / 3/11/2026

📰 NewsDeveloper Stack & Infrastructure

Read original →

共有:

Key Points

This release, v0.17.1, is a patch update to the previous v0.17.0 version, focusing on fixing multiple issues identified in the codebase.
It addresses bugs related to the passing of activation_type in trtllm fused MoE implementations for NVFP4 and FP8 precisions.
The patch restores support for nongated fused MoE Triton setups and re-enables EP support for the trtllm MoE FP8 backend.
Additional fixes include GPU cache management improvements for Mamba and Qwen3.5 models, and optimizations in indexer handling for DSV3.2 and MTP components.
These targeted fixes enhance stability and performance for users leveraging trtllm fused MoE and related machine learning infrastructure components.

This is a patch release on top of v0.17.0 to address a few issues:

Fix passing of activation_type to trtllm fused MoE NVFP4 and FP8 (#36017)
Fix/resupport nongated fused moe triton (#36412)
Re-enable EP for trtllm MoE FP8 backend (#36494)
[Mamba][Qwen3.5] Zero freed SSM cache blocks on GPU (#35219)
Fix TRTLLM Block FP8 MoE Monolithic (#36296)
[DSV3.2][MTP] Optimize Indexer MTP handling (#36723)

Related Articles

Why Regex is Not Enough: Building a Deterministic "Sudo" Layer for AI Agents

Why Regex is Not Enough: Building a Deterministic "Sudo" Layer for AI Agents

Dev.to

I Built a Full-Stack App in 5 Minutes with 8080.ai — Here's How

I Built a Full-Stack App in 5 Minutes with 8080.ai — Here's How

Dev.to

I Shipped 6 Developer Tools in One Day Using an AI Agent Fleet

I Shipped 6 Developer Tools in One Day Using an AI Agent Fleet

Dev.to

Workflow Builders vs AI Agents: 5 Automation Tools Compared (2026)

Workflow Builders vs AI Agents: 5 Automation Tools Compared (2026)

Dev.to

Let AI Control Your Real Browser — Not a Throwaway One

Let AI Control Your Real Browser — Not a Throwaway One

Dev.to

関連おすすめサービス

※当サイトはアフィリエイト広告を利用しています

Notta搭載AI議事録イヤホン ZENCHORD1

AI時代の仕事術。Notta搭載で会議の議事録を自動生成するスマートイヤホン。

AI搭載ボイスレコーダー Plaud

世界100万人が愛用。AIで文字起こし・要約を自動化するボイスレコーダー。

画像高画質化AIツール Aiarty Image Enhancer

AIで画像を高画質化。写真・イラストを簡単にアップスケール。