Is anyone else having issues with Qwen 122B falling apart completely at ~ 100K context?
I am using VLLM with the olka-fi MXFP4 quant.
When the model hits this threshold it abruptly just stops working. Agents work great up until this point, and then it just stops following instructions for more than maybe 1 step.
I saw someone mention this about 27B yesterday, but now I can't find the post. It's definitely happening with 122b as well
[link] [comments]




