Tags
13 pages
GPU
How to Pick a GPU in April 2026: Which Models to Avoid and Which Ones Are More Worth Considering
Ubuntu 26.04 LTS GPU and Hardware Updates: CUDA, ROCm, DPC++, and More Platform Changes
How to Fix Ollama Using CPU Instead of GPU
What Is NVIDIA nvbandwidth: How to Use This GPU Bandwidth Testing Tool
How to Check Whether a Tesla V100 Has ECC Errors
Is Tesla V100 Still Worth Buying: ECC Checks, Cooling Mods, and DIY Pitfalls
llama.cpp GPU Benchmark: CUDA vs ROCm vs Vulkan Scoreboard and pp512/tg128 Explained
What the Common GPU Inference Benchmark Metrics Actually Mean: FA, pp512, tg128, and Q4_0
A Practical Guide to Common Tensor Formats in LLMs: FP32, FP16, BF16, TF32, and FP8
A 16GB GPU Can Still Run 35B Models: VRAM Compression Strategies for MoE Models in LM Studio
12V-2x6 vs. 12VHPWR: Notes on GPU 16-Pin Power Connector Differences
Ollama Multi-GPU Notes: VRAM Pooling, GPU Selection, and Common Misunderstandings
How to Check Whether an Ollama Model Is Loaded on GPU