🍥

记录并分享日常

Tags

10 pages

LLM

Laptop RTX 4060 8GB Local AI Guide: LLMs, Stable Diffusion, FLUX, and VRAM Limits

Best Local LLMs for an RTX 3060 12GB: Quantization, Context, and Benchmarks

TradingAgents-CN: A Multi-Agent Financial Trading Research Framework for Chinese Users

Prompt Optimizer: An Open-Source Tool for Prompt Optimization, Testing, and MCP

Google LangExtract: Extract Structured Data from Long Text with LLMs

Why LLM APIs Charge by Tokens: A Clear Guide to Input, Output, and Context Costs

DeepSeek-V4 Preview Guide: 1M Context, V4-Pro, V4-Flash, and API Migration

What the Common GPU Inference Benchmark Metrics Actually Mean: FA, pp512, tg128, and Q4_0

A Practical Guide to Common Tensor Formats in LLMs: FP32, FP16, BF16, TF32, and FP8

LLM Quantization Explained: How to Choose FP16, Q8, Q5, Q4, or Q2