What would you choose if you were in my shoes? How viable is 125k for agentic coding really? is "compact" really good enough, or would you go with Q6_K 125k?
I am getting around 165-170 tok/sec with either config with my 5090.
[link] [comments]
Reddit r/LocalLLaMA / 4/18/2026
What would you choose if you were in my shoes? How viable is 125k for agentic coding really? is "compact" really good enough, or would you go with Q6_K 125k?
I am getting around 165-170 tok/sec with either config with my 5090.