Not seeing any reports in the llama-cpp metal performance tracking github issue .
If anyone has access to this machine could you post the PP and TG results of:
./llama-bench \ -m llama-7b-v2/ggml-model-q4_0.gguf \ -p 512 -n 128 -ngl 99 [link] [comments]
