In case it helps anyone I'm sharing the config I am using for Qwen3-5-122B-A10B-NVFP4 deployed on a single 6000 Pro.
https://github.com/ian-hailey/vllm-docker-Qwen3-5-122B-A10B-NVFP4
[link] [comments]
Reddit r/LocalLLaMA / 3/22/2026
In case it helps anyone I'm sharing the config I am using for Qwen3-5-122B-A10B-NVFP4 deployed on a single 6000 Pro.
https://github.com/ian-hailey/vllm-docker-Qwen3-5-122B-A10B-NVFP4
Dev.to
LiteLLM Releases
Dev.to
Dev.to
Dev.to