Tags
1 page
GTX 1060
GTX 1060 Running Qwen 35B: Optimizing llama.cpp from 3 tok/s to 17 tok/s