AI Navigate

Qwen3.5 122B INT4 Heretic/Uncensored (and some fun notes)

Reddit r/LocalLLaMA / 3/16/2026

💬 OpinionDeveloper Stack & InfrastructureTools & Practical UsageModels & Research

Key Points

  • The post introduces the Qwen3.5-122B INT4 'Heretic' model and points to a Hugging Face release, noting the weights were tampered and referencing a model card for details.
  • The author describes a small cluster of two ASUS Ascents yielding about 256GB of RAM to support multiple environments.
  • They name and describe several environments—Bare Metal, The Scout, The Genji Glove, The Cardinal, and The Heretic—each with different roles and scale across the cluster.
  • They warn that the interconnect cable is hard to source and share that they ordered one from a supplier named naddod and received guidance.
  • Beyond tinkering, the post frames hardware experimentation as a fun path to accelerate AI learning in 2026.
Qwen3.5 122B INT4 Heretic/Uncensored (and some fun notes)

Hi y'all,

Here is the model: happypatrick/Qwen3.5-122B-A10B-heretic-int4-AutoRound

Been working for decades in software engineering. Never have had this much fun though, love the new dimension to things. Glad I finally found a hobby, and that's making 2026 look better!

Let's go. I got a cluster of ASUS Ascents:

https://preview.redd.it/4yzt9mc7qapg1.png?width=640&format=png&auto=webp&s=33cdbc5b7f20e3b6af01bd45a1b577752947e5cb

DGX Spark guts

Why? Because I am terrible with personal finance. Also, if you want to immerse yourself in AI, make an outrageous purchase on hardware to increase the pressure of learning things.

The 2 of them combined give me ~256GB of RAM to play with. Came up with some operating environments I like:

  • Bare Metal: I use this when I'm trying to tune models or mess around in Jupyter Notebooks. I turn all unnecessary models off. This is my experimentation/learning/science environment.
  • The Scout: I use the Qwen3.5 27B dense and intense. It does fantastic coding work for me in a custom harness. I spread it out on the cluster.
  • The Genji Glove: I dual wield the Qwen3.5 27B and the Qwen3.5 35B. It's when I like to party, 35B is fast and 27B is serious, we get stuff done. They do NOT run across the cluster; they get separate nodes.
  • The Cardinal: The Qwen3.5 122B INT4. Very smart, great for all-around agent usage. With the right harness, it slaps. Yeah, it fucking slaps, deal with that statement. This goes across the cluster.
  • The Heretic: The new guy! My first quantization! That's the link at the top. It goes across the cluster and it's faster than The Cardinal! Qwen3.5 122B, but the weights were tampered with,see the model card for details.

*If you are feeling like getting a cluster, understand that the crazy cable that connects them together is trippy. It's really hard to find. Not an ad, but I ordered one from naddod, and they even wrote me and told me, "close, but we think you don't know what you are doing, here is the cable you are looking for." And they were right. Good folks.

**Lastly, unnecessary opinion block: When trying to use a model for coding locally, it's kind of like basketball shoes. I mean, Opus 4.6 is like Air Jordans and shit, but I bet you I will mess up you and your whole crew with my little Qwens. Skill level matters, remember to learn what you are doing! I say this jokingly, just want to make sure the kids know to still study and learn this stuff. It's not magic, it's science, and it's fun.

Ask me any questions if you'd like, I've had these machines for a few months now and have been having a great time. I will even respond as a human, because I also think that's cool, instead of giving you AI slop. Unless you ask a lot of questions, and then I'll try to "write" things through AI and tell it "sound like me" and you will all obviously know I used AI. In fact, I still used AI on this, because serious, the formatting, spelling, and grammar fixes... thank me later.

submitted by /u/Ok-Treat-3016
[link] [comments]