NVIDIA admits to only 2x performance boost at max throughput with new generation of Rubin GPUs

Reddit r/LocalLLaMA / 3/17/2026

📰 NewsIndustry & Market Moves

共有:

Key Points

NVIDIA reportedly acknowledges that Rubin GPUs deliver roughly a 2x throughput gain at max throughput, despite higher memory bandwidth and FP4 performance claims.
The post notes the need for apples-to-apples benchmarking, rather than comparing chips with very different VRAM configurations.
With 1000W TDP for the B200 versus 2300W for the R200, Rubin would use about 2.3x the power to achieve roughly 2x the performance, raising efficiency questions.
Since most production deployments run at max throughput, the real-world impact may be limited and could influence purchasing decisions.

NVIDIA admits to only 2x performance boost at max throughput with new generation of Rubin GPUs

NVIDIA admits to only 2x performance boost from Rubin at max throughput, which is what 99% of companies are running in production anyway. No more sandbagging comparing chips with 80GB vram to 288GB vram. They're forced to compare apples for apples. Despite Rubin having almost 3x the memory bandwidth and apparently 5x the FP4 perf, that results in only 2x the output throughput.

At 1000W TDP for B200 vs 2300W R200.

So you're using 2.3x the power per GPU to get 2x performance.

Not really efficient, is it?

submitted by /u/bigboyparpa
[link] [comments]

NVIDIA、GTC 2026で次世代AI基盤を発表「Vera Rubin」を軸にエージェント・ゲーム・宇宙領域へ展開のサムネイル画像

Ledge.ai

1Password、AIエージェントのアクセス制御を統合管理する「Unified Access」発表人間・マシン・AIの資格情報を一元統制のサムネイル画像

Ledge.ai

フィジカルAIニュース(2026/3/19号)

note

さらば僕らの元住吉さん！AI漫画『僕は建築ができない』最終話！！

note

国内AIエージェント動向(2026/3/19号)

note

NVIDIA admits to only 2x performance boost at max throughput with new generation of Rubin GPUs

Key Points

Related Articles

NVIDIA、GTC 2026で次世代AI基盤を発表「Vera Rubin」を軸にエージェント・ゲーム・宇宙領域へ展開のサムネイル画像

1Password、AIエージェントのアクセス制御を統合管理する「Unified Access」発表人間・マシン・AIの資格情報を一元統制のサムネイル画像

フィジカルAIニュース(2026/3/19号)

さらば僕らの元住吉さん！AI漫画『僕は建築ができない』最終話！！

国内AIエージェント動向(2026/3/19号)

関連おすすめサービス

Notta搭載AI議事録イヤホン ZENCHORD1

AI搭載ボイスレコーダー Plaud

画像高画質化AIツール Aiarty Image Enhancer

Key Points

Related Articles

NVIDIA、GTC 2026で次世代AI基盤を発表 「Vera Rubin」を軸にエージェント・ゲーム・宇宙領域へ展開のサムネイル画像

1Password、AIエージェントのアクセス制御を統合管理する「Unified Access」発表 人間・マシン・AIの資格情報を一元統制のサムネイル画像

フィジカルAIニュース(2026/3/19号)

さらば僕らの元住吉さん！AI漫画『僕は建築ができない』最終話！！

国内AIエージェント動向(2026/3/19号)

関連おすすめサービス

Notta搭載AI議事録イヤホン ZENCHORD1

AI搭載ボイスレコーダー Plaud

画像高画質化AIツール Aiarty Image Enhancer

NVIDIA、GTC 2026で次世代AI基盤を発表「Vera Rubin」を軸にエージェント・ゲーム・宇宙領域へ展開のサムネイル画像

1Password、AIエージェントのアクセス制御を統合管理する「Unified Access」発表人間・マシン・AIの資格情報を一元統制のサムネイル画像