Don't sleep on the new Nemotron Cascade

Reddit r/LocalLLaMA / 3/22/2026

💬 OpinionIdeas & Deep AnalysisModels & Research

共有:

Key Points

The Nemotron Cascade 2 30B-A3B is a new hybrid model from Nemotron, not based on the Qwen architecture.
In early evaluations, Cascade 2 achieved 97.6% on HumanEval, outperforming medium Qwen3.5 models.
It also scored 88% on ClassEval, indicating strong performance on classification-style benchmarks.
The author used HumanEval + ClassEval along with IQ4_XS quant for assessment and believes Cascade 2 deserves more attention and further testing.

While there has been a lot of discussion regarding the Nemotron Super family of models, I feel like the newest addition, the Nemotron Cascade 2 30B-A3B (which is *not* based on the Qwen architecture despite a similar size, it's a properly hybrid model based on Nemotron's own arch) has largely flown under the radar.

I've been running some evals on local models lately since I'm kind of tired of the "vibe feels" method of judging them. A combo that I quite like is HumanEval + ClassEval, simply because they're quick to run and complicated enough for most small models to still have noticeable differences. So, I gave mradermacher's IQ4_XS quant for a spin.

On HumanEval, Cascade 2 achieved a whopping 97.6%, leaving both medium Qwen3.5 models in the rear window. Similarly, it obtained a respectable 88% on ClassEval.

I'm going to run some more tests on this model, but I feel it deserves a bit more attention.

submitted by /u/ilintar
[link] [comments]

💡 Insights using this article

This article is featured in our daily AI news digest — key takeaways and action items at a glance.

📅 3/22DailyView insight →

I Was Wrong About AI Coding Assistants. Here's What Changed My Mind (and What I Built About It).

Dev.to

Interesting loop

Reddit r/LocalLLaMA

Qwen3.5-122B-A10B Uncensored (Aggressive) — GGUF Release + new K_P Quants

Reddit r/LocalLLaMA

A supervisor or "manager" Al agent is the wrong way to control Al

Reddit r/artificial

FeatherOps: Fast fp8 matmul on RDNA3 without native fp8

Reddit r/LocalLLaMA

Don't sleep on the new Nemotron Cascade

Key Points

💡 Insights using this article

Related Articles

I Was Wrong About AI Coding Assistants. Here's What Changed My Mind (and What I Built About It).

Interesting loop

Qwen3.5-122B-A10B Uncensored (Aggressive) — GGUF Release + new K_P Quants

A supervisor or "manager" Al agent is the wrong way to control Al

FeatherOps: Fast fp8 matmul on RDNA3 without native fp8

関連おすすめサービス

Notta搭載AI議事録イヤホン ZENCHORD1

AI搭載ボイスレコーダー Plaud

画像高画質化AIツール Aiarty Image Enhancer