Is there a DFlash draft model compatible with Qwen3.6 27B yet?

Reddit r/LocalLLaMA / 4/25/2026

💬 OpinionDeveloper Stack & InfrastructureSignals & Early TrendsTools & Practical Usage

Key Points

  • The post asks whether a DFlash draft model designed for Qwen3.5 27B can be used compatibly with Qwen3.6 27B.
  • The author reports that the attempted combination in oMLX worked but resulted in worse PP speed than expected.
  • The discussion is framed around practical compatibility and performance trade-offs when mixing draft and base models.
  • The author’s experience suggests that even small version changes in Qwen models may affect draft-model effectiveness or throughput.
  • Overall, the post seeks guidance from the community on supported draft-model pairings for newer Qwen releases.

Title.

I have the draft for Qwen3.5 (not 3.6) 27B, would it be compatible? I tried this combination in oMLX and PP speed is actually much worse .

submitted by /u/butterfly_labs
[link] [comments]