AI Navigate

M5 Max 128GB with three 120B models

Reddit r/LocalLLaMA / 3/19/2026

💬 OpinionTools & Practical UsageModels & Research

Read original →

共有:

Key Points

The post compares three 120B-scale language models (Nemotron-3 Super, GPT-OSS 120B, and Qwen3.5 122B) on quality and speed.
Nemotron-3 Super is slightly higher in quality than GPT-OSS 120B, but GPT-OSS 120B is about twice as fast.
GPT-OSS 120B achieves roughly 77t/s, while Nemotron-3 Super and Qwen3.5 122B are around 35t/s.
Overall quality ranking is Nemotron-3 Super > GPT-OSS 120B > Qwen3.5 122B, implying trade-offs between speed and fidelity for practical use.

Nemotron-3 Super: Q4_K_M
GPT-OSS 120B: MXFP4
Qwen3.5 122B: Q4_K_M

Overall:

Nemotron-3 Super > GPT-OSS 120B > Qwen3.5 122B
Quality wise: Nemotron-3 Super is slightly better than GPT-OSS 120B, but GPT 120B is twice faster.
Speed wise, GPT-OSS 120B is twice faster than the other 2, 77t/s vs 35t/s ish

submitted by /u/albertgao
[link] [comments]

Related Articles

Manus、AIエージェントをデスクトップ化ローカルPC上でファイルやアプリを直接操作可能にのサムネイル画像

Manus、AIエージェントをデスクトップ化ローカルPC上でファイルやアプリを直接操作可能にのサムネイル画像

Ledge.ai

The programming passion is melting

The programming passion is melting

Dev.to

Best AI Tools for Property Managers in 2026

Best AI Tools for Property Managers in 2026

Dev.to

Building “The Sentinel” – AI Parametric Insurance at Guidewire DEVTrails

Building “The Sentinel” – AI Parametric Insurance at Guidewire DEVTrails

Dev.to

Maximize Developer Revenue with Monetzly's Innovative API for AI Conversations

Maximize Developer Revenue with Monetzly's Innovative API for AI Conversations

Dev.to

関連おすすめサービス

※当サイトはアフィリエイト広告を利用しています

Notta搭載AI議事録イヤホン ZENCHORD1

AI時代の仕事術。Notta搭載で会議の議事録を自動生成するスマートイヤホン。

AI搭載ボイスレコーダー Plaud

世界100万人が愛用。AIで文字起こし・要約を自動化するボイスレコーダー。

画像高画質化AIツール Aiarty Image Enhancer

AIで画像を高画質化。写真・イラストを簡単にアップスケール。