Page 10 - KnightLi Blog

AI Tools

Prompt-Vault: a prompt specification library for testing AI coding ability

A summary of w512/Prompt-Vault: Bubble Sort visualization, todo list, sorting visualization, Kanban board, and Tauri Markdown editor prompts organized by difficulty for testing AI coding agents.

AI Tools

What is Token Efficiency? DeepSeek V4, big-model planning, and small-model execution

A practical view of Token Efficiency in AI coding: DeepSeek V4 Pro / Flash positioning, big models for planning and consultation, small models for execution, plus context budgets, DAG orchestration, task replicas, evaluation, and atomic business workflows.

AI Tools

Superpowers: a skills framework that pulls coding agents back into engineering process

A summary of obra/superpowers: positioning, installation targets, base workflow, skills library, and boundaries. It combines brainstorming, planning, TDD, code review, worktrees, and subagents into a coding-agent methodology.

Hardware

Honeywell PTM7950 confusion: do not judge only by thickness, origin, or black spots

A practical look at PTM7950 and PTM7950SP market confusion: 0.2 mm vs 0.25 mm, origin, black spots, color, COA, and authorization cannot alone prove authenticity or quality. Buyers should focus on batch traceability, testing, quality control, and after-sales support.

AI Tools

mattpocock/skills Explained: How Matt Pocock Uses Skills to Constrain AI Coding

An analysis of Matt Pocock's mattpocock/skills repository: how grill-me, to-spec, to-tickets, TDD, and architecture review constrain AI coding, plus how the workflow compares with Superpowers.

Technical Docs

GPT-5.5 Prompt Migration Guide: Why old prompts should be trimmed before rewritten

A practical summary of OpenAI's GPT-5.5 prompting guide: shorter outcome-first prompts, reasoning effort, preambles and phase, retrieval budgets, validation rules, and what to remove first when migrating old prompts.

AI Tools

What is cc-haha? A project that turns Claude Code into a desktop workbench

A look at NanmiCoder/cc-haha: its positioning, desktop workbench, Computer Use, multi-model setup, H5 remote access, installation flow, and risk boundaries.

AI Tools

Codex /goal vs Claude Code /goal: running long tasks until they are done

A comparison of the /goal command in Codex CLI and Claude Code: both target long-running tasks and completion conditions, but they differ in availability, setup, evaluation, and best-fit workflows.

AI Industry

What Jensen Huang Was Really Saying in His CMU Speech

A concise reading of Jensen Huang's CMU speech: young people may need to relearn hardship, traditional career paths are changing, and the hard problems of the AI era will be harder than they first appear.

AI Tools

Connecting Claude to Fusion 360: An Example of Editing STEP Models With AI

A practical walkthrough of connecting Claude to Fusion 360: enabling the API/MCP service, connecting the port, letting AI analyze a gear structure, and converting a screw-mounted planetary gear into a bearing-based design.

AI Tools

How Can Codex Use Chinese LLMs? Managing OpenAI-Compatible APIs with CCX

CCX is an AI API proxy and protocol-conversion gateway for Claude Messages, OpenAI Chat, OpenAI Images, Codex Responses, and Gemini. This article explains its positioning, deployment, endpoints, channel orchestration, environment variables, and operational cautions.

AI Tools

How Can Codex Use Chinese LLMs? OpenAI-Compatible APIs and the CodexBridge Approach

CodexBridge wraps Codex CLI/SDK as an OpenAI-compatible chat API, allowing OpenWebUI, Cherry Studio, curl, and other clients to call local Codex through /v1/chat/completions. This article explains its use cases, deployment, sessions, multimodal input, structured output, and common configuration.

Technical Docs

Computer Terms in Plain Language: What TTS, STT, API, RAG, and Agent Really Mean

Many computer terms sound impressive, but they often describe very simple things. This article explains common terms such as TTS, STT, API, SDK, CRUD, Cache, Queue, Embedding, RAG, and Agent in plain language.

AI Tools

Sulphur 2 on 8GB VRAM: LTX 2.3 Local Video Generation Guide

Can Sulphur 2 run on 8GB VRAM? This LTX 2.3 guide covers realistic local text-to-video and image-to-video workflows, tool choices, and memory limits.

AI Tools

Running DeepSeek 4 Locally: Antirez's ds4 Experiment on Apple Silicon Mac

ds4 is a local DeepSeek V4 Flash inference engine written by Antirez for Apple Silicon, with CLI, HTTP server, and basic agent capabilities.

AI Tools

Why DeepSeek Became the Cost-Saving Key in This Round of AI Coding Tools

A look at the cost logic behind AI coding tools: why Claude Code, OpenClaw, Superpowers, and similar agent tools consume so many tokens, and why DeepSeek V4's long context and low cache pricing make it a key cost saver.

AI Industry

ProgramBench Raw Leaderboard Data: Model Scores, Costs, and 200 Task Records

A structured copy of ProgramBench's public leaderboard, extended results, and 200 task records, preserving model scores, costs, call counts, test counts, and best scores.

AI Industry

ProgramBench 0% Explained: The Scary Part Is Not Failure, but a Clear Roadmap

A concise explanation of ProgramBench, its 0% result, and what it really means for AI Coding: today's models cannot yet rebuild complete software from scratch, but full software engineering has now become a benchmarkable target.

AI Tools

GPT-5.5 vs GPT-5.4 vs GPT-5.3-Codex: Which Model to Use?

Which OpenAI model should you use? Compare GPT-5.5, GPT-5.4, and GPT-5.3-Codex for coding, reasoning, Codex tasks, API access, speed, and cost.

AI Tools

How to Choose AI Coding Plans: Convenience for Light Users, Flexibility for Heavy Users

A practical guide to choosing AI coding tools and model plans: light users should prioritize convenience, mid-level users should focus on value, and heavy users should decouple models from tools to avoid being locked into a single ecosystem.

AI Tools

Chrome Silently Downloads 4GB Gemini Nano: How to Check, Disable, and Delete It

A concise look at the controversy around Chrome silently downloading the roughly 4GB Gemini Nano local AI model, including file locations, affected platforms, Google's response, and how users can check and disable it.

AI Tools

llama.cpp Multi-GPU: 2 GPUs vs 1, tensor-split, and VRAM

Should you use two GPUs with llama.cpp? Compare 2x16GB vs 1x32GB, layer and tensor split modes, VRAM offload, PCIe limits, and NVLink performance.

AI Tools

Claude Code Limits Doubled: Anthropic Uses SpaceX Compute Expansion to Ease Usage Constraints

A summary of Anthropic's May 2026 increase to Claude Code and Claude API limits, and what its SpaceX compute partnership means for Claude Pro, Max, Team, and enterprise users.

AI Tools

OpenAI's New Realtime Voice Models: GPT-Realtime-2, Live Translation, and Streaming Transcription

A concise look at OpenAI's May 2026 Realtime API voice models, including GPT-Realtime-2, GPT-Realtime-Translate, and GPT-Realtime-Whisper: capabilities, use cases, pricing, and developer impact.

AI Tools

Claude Account Suspended or Limited: Causes, Checks, and Appeal Steps

Diagnose a suspended or limited Claude account, distinguish quota and billing errors, collect evidence, and follow the official appeal process safely.

AI Tools

From PPT to Prototypes: Use Cases for Guizang PPT Skill and Huashu Design

A look at the positioning, capability differences, use cases, and practical recommendations for two open-source Agent design Skills: guizang-ppt-skill and huashu-design.

Technical Docs

Dirty Frag CVE-2026-43284: Linux Local Privilege Escalation Risk and Mitigation Guide

A practical guide to Dirty Frag CVE-2026-43284, including affected Linux attack paths, CVE-2026-43500 / rxrpc risk, interim mitigations, patch priority, and post-compromise checks.

Technical Docs

Btrfs Scrub Guide: Data Verification, Auto-Repair, and Regular Maintenance

A practical guide to what Btrfs scrub does, how to run it, when auto-repair works, NOCOW file risks, read-only scrub caveats, maintenance intervals, and bandwidth limiting.

Hardware

Intel DG1, Arc A310, and Arc A380 Buying Guide: Low-Power GPUs and AV1 Display Cards Compared

A practical comparison of Intel Iris Xe DG1, Arc A310, and Arc A380 across architecture, VRAM, power, AV1 encode/decode, compatibility, and use cases such as NAS, HTPC, display output, light gaming, and hardware tinkering.

AI Industry

Anthropic Partners With SpaceX: Frontier AI Enters the Heavy-Industry Compute Era

A look at the industry logic behind Anthropic's SpaceX compute deal: Claude usage limits, Colossus 1, GPU utilization, energy constraints, semiconductor supply chains, and AI infrastructure competition.

AI Industry

Musk vs. OpenAI Trial: Nonprofit Mission, Control, and the AI Race

A structured overview of the lawsuit between Elon Musk, OpenAI, and Sam Altman: nonprofit mission, for-profit structure, control disputes, and what the trial may signal for AI governance.

AI Tools

How to Detect Claude 4-Generated Text: AI Text Detection Tools and Methods

A practical guide to tools, algorithmic signals, and review workflows for detecting text generated by Claude 4 and other modern LLMs, with the reminder that AI detection is only probabilistic evidence.

Technical Docs

Does F2FS Freeze an HC620 SMR Drive? Linux SMR Disk Troubleshooting Guide

Why HC620 Host-managed SMR drives may show high I/O wait and system freezes under F2FS, plus practical mount options, scheduler tuning, GC limits, and filesystem alternatives.

AI Industry

miHoYo LPM 1.0 Explained: How an AI Video Model Could Reshape Game NPCs

A concise look at LPM 1.0: not a generic text-to-video tool, but a real-time character performance model for conversational agents, virtual streamers, and game NPCs.

AI Industry

Canonical Ubuntu AI Roadmap: Local Inference First, No Forced Integration

A summary of Canonical's Ubuntu AI roadmap: opt-in previews after Ubuntu 26.10, AI CLI, Settings Agent, local-first inference, and pluggable backends without forced defaults.

AI Tools

Codex vs Claude Code: How to Choose Between Two Subagent Designs

A comparison of Codex and Claude Code subagent design: Codex emphasizes explicit delegation and main-session control, while Claude Code looks more like a configurable, memorable, isolated, background-capable agent workstation system.

AI Tools

9Router Setup Guide: Route Claude Code, Codex, and Cursor with Fallback

Set up 9Router as a local OpenAI-compatible endpoint for Claude Code, Codex, Cursor, and Cline, with provider fallback and token saving.

AI Tools

DeepSeek-TUI: Run a DeepSeek Coding Agent in Your Terminal

A practical overview of DeepSeek-TUI: a terminal coding agent for DeepSeek models with file editing, shell execution, Plan/Agent/YOLO modes, auto model selection, MCP, session resume, and workspace rollback.

AI Tools

Goose AI Agent Setup: Desktop, CLI, MCP, API, and Local Models

Set up the open-source Goose AI agent with its desktop app, CLI, MCP extensions, API, and local or hosted models for practical automation workflows.

AI Tools

Laptop RTX 4060 8GB Local AI Guide: LLMs, Stable Diffusion, FLUX, and VRAM Limits

A practical laptop RTX 4060 8GB local AI guide covering 3B-8B LLMs, GGUF quantization, Stable Diffusion, FLUX low-VRAM workflows, Whisper, image indexing, thermals, and VRAM limits.

Development Tools

How to Change the VS Code Display Language: Chinese, English, and More

A concise guide to changing the VS Code display language by installing language packs, using the Command Palette, or setting Chinese, English, Japanese, Korean, and other languages through argv.json.

Hardware

AMD ROCm 7.2 ComfyUI Guide: Run Local AI on Radeon and Ryzen AI

Set up ComfyUI with AMD ROCm 7.2 on Windows or Linux, compare Radeon and Ryzen AI support, and understand CUDA alternative limits.

Hardware

RTX 5090 / 5080 AI Inference Benchmarks: Choosing for Local LLMs, 4K Video, and Real-Time 3D

A practical look at RTX 5090 and RTX 5080 specs and AI benchmarks, focusing on VRAM, bandwidth, FP4, software support, local LLMs, 4K video generation, image generation, and real-time 3D workflows.

Technical Docs

DeepSeek V4 Local Private Deployment: Choosing Domestic Chips or Consumer GPU Clusters

A practical guide to DeepSeek V4 local private deployment: how enterprises can choose between data security, domestic chip support, consumer GPU clusters, inference frameworks, and cost.

Technical Docs

Best Local LLMs for RTX 3060 12GB: Models, GGUF, and VRAM Limits

Choose local LLMs for an RTX 3060 12GB: practical 7B to 12B GGUF models, Q4/Q5 quantization, VRAM limits, and setup options for Ollama or llama.cpp.

Technical Docs

How to Draw Dashed Lines, Arrows, Curves, and Change Canvas Size in AI

A beginner-friendly guide to common AI software tasks: how to draw dashed lines, arrows, curves, and how to change canvas or artboard size in a vector design tool.

AI Tools

Claude Code Tips Guide: Plan Mode, Rewind, CLAUDE.md, Skills, Agents, and Plugins

A beginner-friendly Claude Code tips guide covering project startup, plan mode, permissions, rewind, terminal commands, context management, CLAUDE.md, Skills, Agents, and plugins.

Development Tools

opencode Guide: Open Source AI Coding Agent Setup, Models, and Claude Code Comparison

A practical opencode guide covering setup, model providers, terminal workflow, and how this open source AI coding agent compares with Claude Code and Codex.

AI Tools

Claude Model Lineup 2026: Opus vs Sonnet vs Haiku, Which One Should You Use?

Compare Claude Opus, Sonnet, and Haiku models by coding ability, speed, cost, context window, and best use cases. Includes a simple model selection table for developers.

Development Tools