https://github.com/ggml-org/llama.cpp/pull/22196
And somehow we already got some GGUFs for it!
https://huggingface.co/CISCai/gemma-4-31B-it-NVFP4-turbo-GGUF
https://huggingface.co/stevelikesrhino/gemma-4-31B-it-nvfp4-GGUF
(the below one is from PR author himself)
https://huggingface.co/michaelw9999/Nemotron-Cascade-2-30B-A3B-NVFP4-GGUF
[link] [comments]

