🍥

记录并分享日常

Tags

8 pages

Quantization

What Is AI-Trader? A Platform Where AI Agents Publish Trading Signals and Run Paper Trading

Running Qwen3.6 Locally: VRAM Requirements for 27B and 35B-A3B Quantized Models

Running DeepSeek V4 Locally: VRAM Estimates for Pro, Flash, and Base Versions

Running Gemma 4 Locally: VRAM Requirements for E2B, E4B, 26B, and 31B Quantized Models

A 16GB GPU Can Still Run 35B Models: VRAM Compression Strategies for MoE Models in LM Studio

How to Use llama-quantize for GGUF Models

Choosing Llama GGUF Quantization on Hugging Face: Practical Advice from Q8 to Q2

LLM Quantization Explained: How to Choose FP16, Q8, Q5, Q4, or Q2