Tags
10 pages
LLM
Which Local AI Models Can a Laptop RTX 4060 8GB Run?
Local LLM Models Recommended for an RTX 3060 GPU
TradingAgents-CN: A Multi-Agent Financial Trading Research Framework for Chinese Users
Prompt Optimizer: An Open-Source Tool for Prompt Optimization, Testing, and MCP
Google LangExtract: Extract Structured Data from Long Text with LLMs
Why LLM APIs Charge by Tokens: A Clear Guide to Input, Output, and Context Costs
DeepSeek-V4 Preview Released: 1M Context, Two Models, and API Migration Notes
What the Common GPU Inference Benchmark Metrics Actually Mean: FA, pp512, tg128, and Q4_0
A Practical Guide to Common Tensor Formats in LLMs: FP32, FP16, BF16, TF32, and FP8
LLM Quantization Explained: How to Choose FP16, Q8, Q5, Q4, or Q2