Nemotron 3 Super 120b Claude Distilled

Reddit r/LocalLLaMA / 3/19/2026

📰 NewsTools & Practical UsageModels & Research

共有:

Key Points

Nemotron 3 Super-120B Claude distilled has been released in beta by user /u/ghgi_ on Reddit, distilled from the Claude 4.6 Opus Reasoning dataset.
The beta includes about 2.3K examples from the 3000x dataset, with a planned V2 that will include more data once funding allows.
It is available in BF16, FP8, and GGUF formats (Q4_K_M + Q8_0) via separate HuggingFace model cards for each precision.
The poster invites feedback on performance and whether the model has been lobotomized, and links to the model pages and related discussions.

Hello everyone, Just wanted to post my V1 iteration of Nemotron 3 super 120B distilled from the 4.6 3000x dataset.

This is a beta for the most part only, ~2.3K examples so far from the 3000x dataset. Planning a V2 with more data just can't afford it right now. Would love to hear results and suggestions, in some quick tests it seemed like it worked but let me know if I lobotomized it or not.

Available in BF16, FP8, and GGUF (Q4_K_M + Q8_0)
https://huggingface.co/blobbybob/Nemotron-3-Super-120B-A12B-BF16-Claude-4.6-Opus-Reasoning-Distilled
https://huggingface.co/blobbybob/Nemotron-3-Super-120B-A12B-FP8-Claude-4.6-Opus-Reasoning-Distilled
https://huggingface.co/blobbybob/Nemotron-3-Super-120B-A12B-GGUF-Claude-4.6-Opus-Reasoning-Distilled

submitted by /u/ghgi_
[link] [comments]