Hello everyone, Just wanted to post my V1 iteration of Nemotron 3 super 120B distilled from the 4.6 3000x dataset.
This is a beta for the most part only, ~2.3K examples so far from the 3000x dataset. Planning a V2 with more data just can't afford it right now. Would love to hear results and suggestions, in some quick tests it seemed like it worked but let me know if I lobotomized it or not.
Available in BF16, FP8, and GGUF (Q4_K_M + Q8_0)
https://huggingface.co/blobbybob/Nemotron-3-Super-120B-A12B-BF16-Claude-4.6-Opus-Reasoning-Distilled
https://huggingface.co/blobbybob/Nemotron-3-Super-120B-A12B-FP8-Claude-4.6-Opus-Reasoning-Distilled
https://huggingface.co/blobbybob/Nemotron-3-Super-120B-A12B-GGUF-Claude-4.6-Opus-Reasoning-Distilled
[link] [comments]


