🍥

记录并分享日常

Tags

2 pages

Multi-GPU

A Practical llama.cpp Multi-GPU Benchmarking Approach: Is 2x V100 16GB Faster Than One 32GB Card?

Ollama Multi-GPU Notes: VRAM Pooling, GPU Selection, and Common Misunderstandings