🍥

记录并分享日常

Tags

7 pages

Qwen

Has Qwen3.8 Actually Launched? Only the Preview Is Available, While Open Weights Are Still Pending

Qwythos-9B Claude Mythos 1M Context Guide: vLLM, SGLang, Transformers

GTX 1060 Running Qwen 35B: Optimizing llama.cpp from 3 tok/s to 17 tok/s

NVIDIA Qwen3.6-35B-A3B-NVFP4 Guide: FP4 Quantization and vLLM Deployment

RTX 3060 12GB + Qwen3.6 35B: llama.cpp --n-cpu-moe Setup

Qwen3.6-35B-A3B jailbreak local deployment: uncensored GGUF, llama.cpp, and safety boundaries

Running Qwen3.6-35B Locally on an RTX 3070 8GB: llama.cpp Deployment Notes and Tuning Parameters