Avatar 🍥

KnightLi Blog

记录并分享日常

  1. Home
  2. About
  3. Archives
  4. Search
  5. Links
    1. Dark Mode

Archives

2026 87
2025 23
2024 5
2023 9
2022 33
2021 5
2020 8

Categories

Technical Docs AI Tools Hardware Operations Development Tools Hardware Related AI Industry Blockchain Technical Documentation

Tags

Ollama Ubuntu Gemma 4 Local LLM AI Agent Python Windows Hugging Face Linux Local LLMs Pinout Agent Skills Codex Hugo Llama.cpp Nginx OpenClaw API Browser Automation Docker GGUF MCP Playwright Playwright CLI Vs-Code Anthropic Certbot ChatGPT Claude Gemini
Hardware

12V-2x6 vs. 12VHPWR: Notes on GPU 16-Pin Power Connector Differences

A concise note on the main differences between the 12V-2x6 and 12VHPWR GPU 16-pin power connectors: cable compatibility, pin length, SENSE logic, H++ marking, and 600W output capability.

2026-04-19
4 minute read
中文简体 中文繁體 日本語
AI Tools

Karpathy's 65-Line CLAUDE.md: Helping AI Coding Avoid Three Common Mistakes

A summary of Karpathy's observations on AI coding, and how Forrest Cheung turned those problems into CLAUDE.md rules: think first, keep it simple, make precise changes, and work toward verifiable goals.

2026-04-19
6 minute read
中文简体 中文繁體 日本語
Hardware

Core Ultra 9 285T ES Notes: Q4A7, a B860 Engineering Board, and the 35W Power Wall

Notes on the Core Ultra 9 285T ES sample Q4A7: platform, motherboard, power delivery, memory, performance, gaming results, and buying advice. The specs look tempting, but the 35W power wall, high DDR5 latency, scarce ES boards, and limited BIOS make it better suited to low-power tinkering than a gaming PC.

2026-04-19
9 minute read
中文简体 中文繁體 日本語
AI Tools

Using Claude Code Quota More Efficiently: Models, Context, Caching, and /compact

A practical note on why Claude Code and Claude Pro/Max usage can run out quickly: model choice, 5-hour usage windows, long conversations, files and images, cache misses, CLAUDE.md, MCP, and skills, with habits around /compact, /clear, /context, and /status.

2026-04-19
8 minute read
中文简体 中文繁體 日本語
AI Tools

rembg Project Notes: A Local Background Removal Tool

A practical look at danielgatis/rembg: what it is, how to install it, CLI usage, Python integration, HTTP server mode, Docker usage, model choices, and where it fits in local background removal workflows.

2026-04-19
7 minute read
中文简体 中文繁體 日本語
AI Tools

Ollama Multi-GPU Notes: VRAM Pooling, GPU Selection, and Common Misunderstandings

A practical summary of Ollama multi-GPU behavior: when models are split across GPUs, how to limit devices with CUDA_VISIBLE_DEVICES / ROCR_VISIBLE_DEVICES, whether VRAM can be pooled, whether mixed GPUs work, and common Docker, PCIe, and performance pitfalls.

2026-04-19
9 minute read
中文简体 中文繁體 日本語
Hardware

Lenovo HR630x / HR650x Notes: LGA3647, 8259CL, Optane, and Common Pitfalls

Based on HR630x build logs and HR650x troubleshooting notes, this post summarizes buying and setup considerations for Lenovo HR630x / HR650x LGA3647 server barebones: CPU and Optane pairing, VRM unlocks, fan control, risers, backplanes, and BMC/UEFI notes.

2026-04-18
11 minute read
中文简体 中文繁體 日本語
Hardware

MCP2221A-I/ST Selection Notes: A Handy USB-to-I2C/UART Bridge Chip

A quick look at the key parameters and practical notes for Microchip MCP2221A-I/ST: USB 2.0 to I2C/UART, GPIO multiplexing, supply range, package, speed limits, and why it belongs in a hardware debugging toolkit.

2026-04-18
6 minute read
中文简体 中文繁體 日本語
Hardware

LGA3647 high TDC OEM CPU lighting idea: modify the ICC_MAX of VRM

Compiled the thoughts on the VRM ICC_MAX modification of the high TDC OEM Xeon processor on the LGA3647 platform in the ServeTheHome forum: why the machine does not turn on, what needs to be prepared, specific motherboard wiring, flash commands, BIOS modifications and risk precautions.

2026-04-18
18 minute read
中文简体 中文繁體 日本語
AI Tools

Google App for Desktop: Bringing AI Search to Windows

A practical introduction to Google app for desktop: supported devices, Alt + Space shortcut, AI Mode, Google Lens, screen sharing, file uploads, local file search, and Google Drive search.

2026-04-18
7 minute read
中文简体 中文繁體 日本語
Operations

Understanding the nftables Framework: Tables, Chains, Rules, and Sets

A concept-level overview of the nftables framework: what table, family, chain, rule, set, map, and verdict map are for, and how they work together to form maintainable firewall rules.

2026-04-18
5 minute read
中文简体 中文繁體 日本語
Operations

nftables Quick Start: Tables, Chains, Rules, and Common Operations

A practical nftables quick start: understand table, chain, and rule, then use common commands for IP, MAC, port matching, traffic counters, rate limiting, and rule cleanup.

2026-04-18
4 minute read
中文简体 中文繁體 日本語
AI Tools

Gemma 4 E4B Uncensored vs Official: What Actually Changes

A practical comparison between the unofficial Gemma-4-E4B-Uncensored-HauhauCS-Aggressive release and Google's official Gemma 4 E4B-it model, including behavior, safety, licensing, and deployment trade-offs.

2026-04-18
4 minute read
中文简体 中文繁體 日本語
AI Tools

Deploy Hermes Agent Locally on Windows with WSL + Ollama and Connect Telegram

A practical local setup flow for Hermes Agent on Windows: install WSL and Ubuntu, add Ollama and Gemma 4, then complete a basic Telegram connection.

2026-04-18
4 minute read
中文简体 中文繁體 日本語
AI Tools

Where Does llama-cli -hf Save Hugging Face Models by Default

A quick note on where llama-cli -hf stores GGUF models downloaded from Hugging Face, and how to change the cache directory with LLAMA_CACHE or Hugging Face cache variables.

2026-04-17
2 minute read
中文简体 中文繁體 日本語
AI Tools

How to Fix SSL Certificate Verification Failed When llama-cli Downloads from Hugging Face on Windows

Common causes and fixes when llama-cli fails SSL certificate verification while downloading Hugging Face models with -hf on Windows.

2026-04-17
2 minute read
中文简体 中文繁體 日本語
Hardware

CRPS Common Redundant Server Power Supply Standard, Pin Functions, and Common Models

A practical overview of CRPS / M-CRPS common redundant server power supplies, including the 2x25 edge connector pinout, PSON/12VSB/PMBus signal functions, and common CRPS PSU models.

2026-04-17
11 minute read
中文简体 中文繁體 日本語
Hardware

CSPS Common Slot Server Power Supply Interface and Pinout

A practical overview of CSPS / Common Slot server power supplies, including the 64 pin edge connector pinout, 12V output enable method, PMBus/SMBus signals, and breakout board design notes.

2026-04-16
18 minute read
中文简体 中文繁體 日本語
AI Tools

codex-quota Practical Guide: Local, Web, and Docker Usage with Original CLI Commands

`codex-quota` is a lightweight tool to check ChatGPT Codex quota usage, covering local CLI, web service, and Docker/Compose usage.

2026-04-16
3 minute read
中文简体 中文繁體 日本語
Development Tools

Build Docker Images in VS Code on Windows: From Setup to Build

A practical guide to building Docker images in VS Code on Windows, including prerequisites, Dockerfile setup, build methods, and quick troubleshooting.

2026-04-16
2 minute read
中文简体 中文繁體 日本語
AI Tools

Claude Identity Verification: Why It Exists, What You Need, and How Data Is Handled

A summary of Anthropic's Claude identity verification guide, covering when verification may appear, what documents are accepted, Persona's role, data protection, and what to do if verification fails.

2026-04-16
5 minute read
中文简体 中文繁體 日本語
AI Tools

How Codex Usage Limits Work: 5-Hour Limits, Weekly Limits, and Credits

Explains Codex 5-hour limits, weekly limits, credit consumption, local tasks versus cloud tasks, and why weekly usage can drop even when the 5-hour quota is not exhausted.

2026-04-15
5 minute read
中文简体 中文繁體 日本語
Hardware

A Practical Guide to Common U.2 Enterprise SSD Series

A practical overview of common U.2 enterprise SSD series from Solidigm, Samsung, Western Digital, Micron, and Kioxia, with a focus on product positioning and typical use cases.

2026-04-15
7 minute read
中文简体 中文繁體 日本語
AI Tools

RAGFlow Project Notes: Features and Usage of an Open-Source RAG Engine

A practical overview of infiniflow/ragflow, covering its core positioning, major features, deployment approach, and basic usage flow for enterprise knowledge bases and AI Q&A systems.

2026-04-15
6 minute read
中文简体 中文繁體 日本語
AI Tools

Firecrawl Project Notes: Web Search, Scraping, and Interaction APIs for AI Agents

A concise look at Firecrawl's positioning, core features, use cases, self-hosting options, and licensing boundaries, with a focus on whether it fits as a web data layer for AI agents.

2026-04-15
5 minute read
中文简体 中文繁體 日本語
AI Tools

Playwright CLI Video Recording: Recording, Chapter Markers, Overlays, and Debugging Tradeoffs

Based on the official video-recording reference, this article organizes video capture, chapter markers, Overlay APIs, and the practical differences between video and tracing in Playwright CLI.

2026-04-15
6 minute read
中文简体 中文繁體 日本語
AI Tools

Playwright CLI Session Management: Multiple Browser Sessions, Isolation, Persistence, and Cleanup

Based on the official session-management reference, this article organizes the common ways to use named browser sessions, session isolation, persistent profiles, concurrent usage, and cleanup commands in Playwright CLI.

2026-04-15
6 minute read
中文简体 中文繁體 日本語
Hardware

M.2 Key E, Key B, and Key M Pinout Notes

A concise summary of M.2 pinout documentation, preserving the Pinout Description tables for Key E, Key B, and Key M sockets, with English notes added.

2026-04-15
6 minute read
中文简体 中文繁體 日本語
AI Tools

Playwright CLI storage state: Save Login Sessions, Read Cookies, and Local Storage

Based on the official storage-state reference, this post summarizes the common Playwright CLI commands for storage state, Cookies, localStorage, sessionStorage, and IndexedDB with concise explanations.

2026-04-14
4 minute read
中文简体 中文繁體 日本語
AI Tools

What Is OpenHarness: What This Open Source Agent Harness Can Do

Based on the official HKUDS/OpenHarness repository and README, this article summarizes OpenHarness's positioning, core capabilities, ohmo's personal-assistant features, and the scenarios it fits best.

2026-04-12
6 minute read
中文简体 中文繁體 日本語
AI Tools

Getting Started with Playwright CLI: Installation, Skills, Sessions, and Essential Commands

Based on the latest microsoft/playwright-cli README, this guide walks through Playwright CLI's positioning, installation, skills workflow, session management, monitoring dashboard, and essential commands.

2026-04-12
7 minute read
中文简体 中文繁體 日本語
AI Tools

What Is Hermes Agent: Overview, Strengths, Getting Started, and How It Compares to OpenClaw

A practical introduction to Nous Research's Hermes Agent: what it is, where it stands out, how to get started, and how it differs from OpenClaw in positioning and user experience.

2026-04-12
7 minute read
中文简体 中文繁體 日本語
AI Tools

OpenClaw Dreaming: Machines Start Dreaming While Humans Lose Sleep

OpenClaw introduced Dreaming, a memory consolidation system modeled on light sleep, deep sleep, and REM to help agents retain signal and discard noise.

2026-04-12
4 minute read
中文简体 中文繁體 日本語
AI Tools

How to Use llama-quantize for GGUF Models

A short introduction to what llama-quantize does, its basic commands, common options, and the tradeoffs between model size, speed, and quality.

2026-04-12
2 minute read
中文简体 中文繁體 日本語
AI Tools

How to Get GGUF Models from Hugging Face with llama.cpp

A short guide to downloading GGUF models with llama.cpp from Hugging Face, switching compatible endpoints, and converting non-GGUF formats.

2026-04-12
1 minute read
中文简体 中文繁體 日本語
AI Tools

Codex Usage and Quota Check

Use a small Python script to read credentials from `auth.json`, call ChatGPT's `/backend-api/wham/usage` endpoint, and inspect remaining Codex quota plus reset times.

2026-04-12
10 minute read
中文简体 中文繁體 日本語
AI Tools

What Does `it` Mean in Gemma-4-31B-it

A brief explanation of what `it` and `31B` mean in Gemma-4-31B-it, and why `it` is usually the right choice for chat use.

2026-04-11
1 minute read
中文简体 中文繁體 日本語
AI Tools

Choosing Llama GGUF Quantization on Hugging Face: Practical Advice from Q8 to Q2

A practical way to understand GGUF quantization levels and choose between Q8, Q6, Q5, Q4, Q3, and Q2 based on hardware limits.

2026-04-11
2 minute read
中文简体 中文繁體 日本語
AI Tools

How to Access a Local Ollama API Over LAN on Windows

Expose Ollama API to your local network on Windows by setting the host, allowing firewall ports, and verifying with curl.

2026-04-11
1 minute read
中文简体 中文繁體 日本語
Hardware

Common USB PD Decoy Chips: CH224K vs HUSB238 vs HUSB237 vs IP2721 vs XSP

A quick comparison of CH224K, HUSB238, HUSB237, IP2721, and XSP series decoy chips for USB PD power design.

2026-04-11
1 minute read
中文简体 中文繁體 日本語
AI Tools

What Models Power fnOS AI Photos: Face, Object, and Semantic Search Stack

A practical breakdown of the fnOS AI photo stack, covering face recognition, object detection, semantic search, and hardware acceleration.

2026-04-11
1 minute read
中文简体 中文繁體 日本語
Operations

go2rtc with Xiaomi Camera RTSP: Feed NVR, HomeKit, and Frigate

A practical setup note for pulling Xiaomi camera RTSP via go2rtc and using it across NVR, HomeKit, and Frigate.

2026-04-11
1 minute read
中文简体 中文繁體 日本語
AI Tools

Gemma 4 Local Runtime Guide: From One-Command Start to Dev Integration

A concise guide to main local runtime paths for Gemma 4, including Ollama, LM Studio, llama.cpp, and developer-oriented integration.

2026-04-10
2 minute read
中文简体 中文繁體 日本語
AI Tools

Drop MCP? Why CLI Is Becoming the Default Tool Layer for Agents

Across cost, reliability, training distribution, and security model, here is why more agent workflows are returning to CLI-first.

2026-04-10
3 minute read
中文简体 中文繁體 日本語
AI Tools

PersonaPlex Quick Guide: Full-Duplex Conversational Speech with Persona and Voice Control

A concise guide to PersonaPlex capabilities, setup, and prompting, including server launch, offline evaluation, and role/voice control.

2026-04-10
2 minute read
中文简体 中文繁體 日本語
AI Tools

Anthropic's Harness Direction: Agent Infrastructure Is Becoming an Agent OS

A concise breakdown of Anthropic's latest practice across session, harness, and sandbox, and why agent architecture is moving toward stable abstractions with recoverable execution.

2026-04-10
2 minute read
中文简体 中文繁體 日本語
AI Tools

OpenClaw and Agent Harness: Why It Looks Like AGI

A harness-based view of OpenClaw: the model remains the core, while autonomy comes from the engineering combination of memory, tools, triggers, and execution loops.

2026-04-10
2 minute read
中文简体 中文繁體 日本語
AI Tools

Sharing an Agent Skill for E-commerce Product Image Cutout and Standardization

An overview of the product-cutout-normalize Agent Skill, including its purpose, usage, parameters, and the full source code for SKILL.md and scripts/run_pipeline.py.

2026-04-09
10 minute read
中文简体 中文繁體 日本語
AI Tools

How to Use Google Nano Banana for Image Cutouts

Based on a practical Python example, this article explains how to use Google Nano Banana for product-image background removal while preserving the full source code.

2026-04-09
8 minute read
中文简体 中文繁體 日本語
AI Tools

What are Ollama cloud models and how do you use them

A brief explanation of what Ollama cloud models are, how they differ from local models, and how to use them from the command line or via API.

2026-04-09
2 minute read
中文简体 中文繁體 日本語
1 2 3 4
© 2022 - 2026 KnightLi Blog
记录并分享
Built with Hugo
Theme Stack designed by Jimmy