This page groups GitHub AI projects by application direction, covering AI coding and Coding Agents, agent skills and workflows, RAG and knowledge bases, multimodal creation, local models and inference, vertical applications and automation, and AI application development infrastructure. New projects can be added later using the same structure.
Category Summary
| Category | Projects | Best for |
|---|---|---|
| AI Coding and Coding Agents | 31 | Developers using Claude Code, Codex, Cursor, terminal Agents, or repo automation |
| Agent Skills and Workflows | 13 | People standardizing AI coding, research, job search, trend analysis, or creative workflows |
| RAG, Knowledge Bases, and Memory | 13 | People building document retrieval, knowledge bases, memory, OCR, parsing, and local vector indexes |
| Multimodal and Content Creation | 11 | People working with images, video, voice, digital humans, Live2D characters, and distribution |
| Vertical Applications and Automation | 12 | People exploring finance, trading, marketplace monitoring, desktop, browser, and mobile automation |
| AI Application Development Infrastructure | 10 | Builders of AI apps, Agent toolchains, browser automation, AI frontend interaction, and API gateways |
| Local Models and Inference | 3 | Users focused on local models, quantization, inference engines, and hardware adaptation |
| General Development, Operations, and Learning Resources | 11 | People filling gaps in infrastructure, frameworks, operations, security, and learning |
AI Coding and Coding Agents
This group focuses on code understanding, code modification, engineering workflows, and terminal agents. It is the largest group, with 22 projects.
| Project | Article | GitHub | Core Use | Best For |
|---|---|---|---|---|
| Ralph | Ralph: turning Claude Code and Amp into an autonomous development loop | snarktank/ralph | Drive Claude Code / Amp through PRD, planning, execution, and review loops | Users who want a straighter agent coding process |
| Claude-Mem | Claude-Mem: long-term cross-session memory for Claude Code | thedotmack/claude-mem | Add cross-session memory to Claude Code | Heavy Claude Code users |
| Claude Code Hooks Mastery | Claude Code Hooks Mastery: getting started with 13 hooks lifecycle stages | disler/claude-code-hooks-mastery | Learn Claude Code hooks lifecycle and automation control | Users who want to customize Claude Code workflows |
| Compound Engineering Plugin | Compound Engineering Plugin: turning AI coding into planning, execution, and review loops | EveryInc/compound-engineering-plugin | Split AI coding into planning, execution, and review cycles | Users who care about engineering discipline in AI coding |
| free-claude-code | free-claude-code: connecting Claude Code to OpenRouter, DeepSeek, and local models | Alishahryar1/free-claude-code | Use a proxy to connect Claude Code to different model backends | Users who want to reduce Claude Code cost |
| Hermes Agent | What is Hermes Agent: overview, strengths, quick start, and OpenClaw comparison | NousResearch/hermes-agent | Local agent framework with tool calling and task execution | Users who want to run local agents |
| OpenHarness | What OpenHarness can do as an open source agent harness | HKUDS/OpenHarness | Agent harness and multi-agent execution framework | Users researching agent orchestration |
| CodexBridge | Using Codex with domestic LLMs: OpenAI-compatible APIs and CodexBridge | begonia599/CodexBridge | Connect Codex to OpenAI-compatible model APIs | Users who want Codex with domestic models |
| ccx | Using CCX to manage OpenAI-compatible APIs for Codex and domestic models | BenedictKing/ccx | Manage API proxies for Claude, Codex, Gemini, and more | Multi-model switching users |
| cc-haha | cc-haha: a desktop workspace for Claude Code | NanmiCoder/cc-haha | Desktop workspace and Computer Use entry for Claude Code | Claude Code users who prefer a GUI |
| DeepSeek-TUI | DeepSeek-TUI: turning DeepSeek V4 into a terminal coding agent | Hmbown/DeepSeek-TUI | Run a DeepSeek coding agent in the terminal | DeepSeek and command-line users |
| Open Design | Open Design: turning Claude Code and Codex into AI design tools | nexu-io/open-design | Bring Claude Code / Codex into design generation | Users who want agents for design prototypes |
| agentmemory | agentmemory: persistent memory for Claude Code, Codex, and Cursor | rohitg00/agentmemory | Add persistent memory to coding agents | Developers maintaining long-running projects |
| Graphify | Graphify: turning a codebase into an AI-queryable knowledge graph | safishamsi/graphify | Convert a codebase into a knowledge graph to reduce repeated file reads | Large-codebase users |
| oh-my-pi | What is oh-my-pi? An AI coding assistant that connects terminal, IDE, and debugger | can1357/oh-my-pi | Connect terminal, IDE, LSP, and debugger as a local AI coding console | Developers who want to unify CLI and IDE workflows |
| Claude Plugins Official | Claude Code now has a plugin directory: what to install, how to install it, and what to watch | anthropics/claude-plugins-official | Official Claude Code plugin directory and installation entry point | Users who want to extend Claude Code |
| CodeGraph | What is CodeGraph? A local code map for Claude Code, Codex, and Cursor | colbymchenry/codegraph | Generate local indexes and relationship graphs to help Coding Agents understand projects | Developers maintaining medium-to-large codebases |
| CC Switch | CC Switch: managing Claude Code, Codex, Gemini CLI, and OpenClaw in one desktop tool | farion1231/cc-switch | Manage multiple AI CLI tools and account/config switching | Users of multiple CLI tools |
| Warp | Warp open source: from terminal to Agentic Development Environment | warpdotdev/warp | Agentic terminal and development environment | Heavy terminal users |
| opencode | opencode vs Claude Code vs Codex: open source AI coding tools guide | anomalyco/opencode | Open source AI coding agent | Users looking for Claude Code / Codex alternatives |
| 9Router | 9Router: connecting Claude Code, Codex, and Cursor to one AI router | decolua/9router | AI coding model routing and token cost control | Multi-tool, multi-model users |
| goose | goose: an open source AI Agent across desktop, CLI, and API | aaif-goose/goose | Open source agent across desktop, CLI, and API | Users who want a general agent workspace |
Agent Skills and Workflows
This group focuses on turning AI capabilities into repeatable skills, processes, and specifications. It includes 7 projects.
| Project | Article | GitHub | Core Use | Best For |
|---|---|---|---|---|
| mattpocock/skills | Rejecting Vibe Coding: Matt Pocock’s skills repo adds engineering constraints to AI coding | mattpocock/skills | Use skills to constrain AI coding workflows | Users who want engineering discipline for agents |
| Superpowers | Superpowers: bringing coding agents back into engineering workflows | obra/superpowers | Agentic skills framework and software development methodology | Users who want systematic coding-agent workflows |
| Prompt-Vault | Prompt-Vault: a prompt specification library for testing AI coding ability | w512/Prompt-Vault | Collect prompt specs for testing AI coding ability | Model and tool evaluators |
| web-video-presentation | web-video-presentation: an agent skill for turning articles into recordable web videos | ConardLi/garden-skills | Turn articles into recordable web videos | Content creators and automation users |
| nuwa-skill | nuwa-skill: making “distilling a person” into an executable workflow | alchaincyf/nuwa-skill | Recreate a person’s expression and thinking flow with a skill | Users building style-based agents |
| Scientific Agent Skills | Scientific Agent Skills: giving research workflows to AI agents | K-Dense-AI/scientific-agent-skills | Skill collection for scientific workflows | Researchers, data analysts, and technical writers |
| easy-vibe | easy-vibe: a learning map for Vibe Coding beginners | datawhalechina/easy-vibe | Learning map for Vibe Coding | AI coding beginners |
RAG, Knowledge Bases, and Memory
This group addresses document retrieval, knowledge base construction, long-term memory, and structured extraction. It includes 7 projects.
| Project | Article | GitHub | Core Use | Best For |
|---|---|---|---|---|
| LangExtract | Google LangExtract: extracting structured data from long text with LLMs | google/langextract | Extract structured information from long text | Information extraction and data processing users |
| qmd | qmd: local Markdown document search for AI agents | tobi/qmd | Local Markdown document search | Users managing knowledge in Markdown |
| Firecrawl | Firecrawl: web search, crawling, and interaction API for AI agents | firecrawl/firecrawl | Web crawling, search, and structured data entry point | RAG and agent data-ingestion users |
| RAGFlow | RAGFlow: features and usage of an open source RAG engine | infiniflow/ragflow | Open source RAG engine | Enterprise knowledge base and document Q&A users |
| OpenHuman | OpenHuman: the desktop route for open source personal AI agents | tinyhumansai/openhuman | Local-first personal AI agent and memory layer | Users who want to integrate personal data |
| OpenKB | OpenKB: compiling documents into continuously updated LLM knowledge bases | VectifyAI/OpenKB | Compile documents into updatable knowledge bases | Documentation knowledge-base maintainers |
| PageIndex | PageIndex: reasoning-style RAG document indexing without vector databases | VectifyAI/PageIndex | Reasoning-style document indexing without vector databases | Users watching new RAG approaches |
Multimodal and Content Creation
This group covers image, video, transcription, and content distribution scenarios. It includes 5 projects.
| Project | Article | GitHub | Core Use | Best For |
|---|---|---|---|---|
| rembg | rembg: local image background removal tool | danielgatis/rembg | Local image background removal | E-commerce, design, and image-processing users |
| awesome-gpt-image-2-prompts | GPT-Image 2 prompt library: e-commerce, posters, portraits, and UI | EvoLinkAI/awesome-gpt-image-2-prompts | GPT-Image 2 prompts and case library | AI art and prompt users |
| faster-whisper | faster-whisper: a faster Whisper transcription engine | SYSTRAN/faster-whisper | High-performance speech-to-text | Subtitle, transcription, and speech-processing users |
| Pixelle-Video | Pixelle-Video: an open source AI engine for generating short videos from one topic | AIDC-AI/Pixelle-Video | One-topic short-video generation workflow | Short-video and AIGC creators |
| AiToEarn | Too many content platforms? AiToEarn uses AI agents to help creators save effort | yikart/AiToEarn | Multi-platform content distribution and creator automation | Content operators and creators |
Local Models and Inference
This group focuses on local model runtime and inference experiments. It currently has fewer projects, with 1 project.
| Project | Article | GitHub | Core Use | Best For |
|---|---|---|---|---|
| ds4 | Running DeepSeek 4 locally: Antirez ds4 on Apple Silicon Mac | antirez/ds4 | Experiment with running DeepSeek 4 on Apple Silicon | Local model and inference experiment users |
Vertical Applications and Automation
This group applies agents or AI capabilities to finance, trading, browsers, desktops, e-commerce monitoring, and other concrete scenarios. It includes 7 projects.
| Project | Article | GitHub | Core Use | Best For |
|---|---|---|---|---|
| TradingAgents-CN | TradingAgents-CN: a multi-agent financial trading research framework for Chinese users | hsliuping/TradingAgents-CN | Multi-agent financial trading research framework | Quant, finance, and agent researchers |
| FinceptTerminal | FinceptTerminal: open source financial terminal, quant research, and AI Agent workspace | Fincept-Corporation/FinceptTerminal | Financial terminal, quant research, and AI agent workspace | Financial analysis and quant users |
| Anthropic financial-services | Anthropic financial-services: reusable templates for financial agent scenarios | anthropics/financial-services | Financial services agent templates | Users building financial AI solutions |
| ai-goofish-monitor | ai-goofish-monitor: open source AI monitoring system for Xianyu products | Usagi-org/ai-goofish-monitor | AI product monitoring and Xianyu automation | Second-hand marketplace monitoring users |
| CloakBrowser | CloakBrowser: a more human-like browser for Playwright and Puppeteer | CloakHQ/CloakBrowser | More human-like browser automation environment | Browser automation and agent operation scenarios |
| UI-TARS-desktop | Let AI operate the computer? UI-TARS-desktop connects desktop, browser, and tools | bytedance/UI-TARS-desktop | Desktop, browser, and tool operation agent | Users who want AI to operate computers |
| AI-Trader | What is AI-Trader: a platform for AI agents to publish trading signals and run simulations | HKUDS/AI-Trader | AI agent trading signals and simulated trading platform | Financial agent and trading researchers |
AI Application Development Infrastructure
This group provides foundational components for building AI applications and agent toolchains. It includes 5 projects.
| Project | Article | GitHub | Core Use | Best For |
|---|---|---|---|---|
| Prompt Optimizer | Prompt Optimizer: open source prompt optimization, testing, and MCP tools | linshenkx/prompt-optimizer | Prompt optimization, testing, and MCP tools | Prompt engineering and app-tuning users |
| Playwright CLI | Playwright CLI basics: installation, skills, sessions, and common commands | microsoft/playwright-cli | Browser automation CLI for coding agents | Agent users who need browser operation |
| Vercel AI SDK | What is Vercel AI SDK? A unified toolkit for TypeScript AI apps | vercel/ai | TypeScript AI application SDK | Front-end and full-stack developers |
| CLIProxyAPI | CLIProxyAPI: wrapping Codex, Claude Code, and Gemini CLI into unified APIs | router-for-me/CLIProxyAPI | Wrap multiple AI CLIs and OAuth login states as compatible APIs | Users who want unified access to Codex, Claude Code, and Gemini CLI |
| CLIProxyAPI Management Center | CLIProxyAPI Management Center: a visual admin console for CLIProxyAPI | router-for-me/Cli-Proxy-API-Management-Center | Web admin UI for CLIProxyAPI configuration, accounts, logs, and OAuth | Users running CLIProxyAPI as a team gateway or account pool |
Latest Additions From the Site
These projects were added after the older translated index and now bring the translated page in sync with the updated Chinese source.
| Project | Category | Article | GitHub |
|---|---|---|---|
| OpenAI Symphony | AI Coding and Coding Agents | OpenAI Symphony | openai/symphony |
| GitHub Spec Kit | AI Coding and Coding Agents | GitHub Spec Kit | github/spec-kit |
| oh-my-codex | AI Coding and Coding Agents | oh-my-codex | Yeachan-Heo/oh-my-codex |
| RTK | AI Coding and Coding Agents | RTK | rtk-ai/rtk |
| Headroom | AI Coding and Coding Agents | Headroom | chopratejas/headroom |
| DeepSeek-Reasonix | AI Coding and Coding Agents | DeepSeek-Reasonix | esengine/DeepSeek-Reasonix |
| Understand-Anything | AI Coding and Coding Agents | Understand-Anything | Lum1104/Understand-Anything |
| Taste Skill | Agent Skills and Workflows | Taste Skill | Leonxlnx/taste-skill |
| academic-research-skills | Agent Skills and Workflows | academic-research-skills | Imbad0202/academic-research-skills |
| last30days-skill | Agent Skills and Workflows | last30days-skill | mvanhorn/last30days-skill |
| Agent-Reach | Agent Skills and Workflows | Agent-Reach | Panniantong/Agent-Reach |
| Personal AI Infrastructure | Agent Skills and Workflows | Personal AI Infrastructure | danielmiessler/Personal_AI_Infrastructure |
| career-ops | Agent Skills and Workflows | career-ops | santifer/career-ops |
| EverOS | RAG, Knowledge Bases, and Memory | EverOS | EverMind-AI/EverOS |
| MemPalace | RAG, Knowledge Bases, and Memory | MemPalace | MemPalace/mempalace |
| open-notebook | RAG, Knowledge Bases, and Memory | open-notebook | lfnovo/open-notebook |
| PaddleOCR | RAG, Knowledge Bases, and Memory | PaddleOCR | PaddlePaddle/PaddleOCR |
| MinerU | RAG, Knowledge Bases, and Memory | MinerU | opendatalab/MinerU |
| turbovec | RAG, Knowledge Bases, and Memory | turbovec | RyanCodrai/turbovec |
| HyperFrames | Multimodal and Content Creation | HyperFrames | heygen-com/hyperframes |
| OpenAI Whisper | Multimodal and Content Creation | OpenAI Whisper | openai/whisper |
| VibeVoice | Multimodal and Content Creation | VibeVoice | microsoft/VibeVoice |
| LongCat-Video | Multimodal and Content Creation | LongCat-Video | meituan-longcat/LongCat-Video |
| WavFlow | Multimodal and Content Creation | WavFlow | facebookresearch/WavFlow |
| Open-LLM-VTuber | Multimodal and Content Creation | Open-LLM-VTuber | Open-LLM-VTuber/Open-LLM-VTuber |
| xianyu-auto-reply | Vertical Applications and Automation | xianyu-auto-reply | zhinianboke/xianyu-auto-reply |
| MobiAgent | Vertical Applications and Automation | MobiAgent | IPADS-SAI/MobiAgent |
| Mobile-Agent | Vertical Applications and Automation | Mobile-Agent | X-PLUG/MobileAgent |
| Mobilerun | Vertical Applications and Automation | Mobilerun | droidrun/mobilerun |
| mobile-use | Vertical Applications and Automation | mobile-use | minitap-ai/mobile-use |
| browser-harness | AI Application Development Infrastructure | browser-harness | browser-use/browser-harness |
| CopilotKit | AI Application Development Infrastructure | CopilotKit | CopilotKit/CopilotKit |
| Vite | AI Application Development Infrastructure | Vite | vitejs/vite |
| Svelte | AI Application Development Infrastructure | Svelte | sveltejs/svelte |
| Microsoft MXC | AI Application Development Infrastructure | Microsoft MXC | microsoft/mxc |
| Remotion | AI Application Development Infrastructure | Remotion | remotion-dev/remotion |
| llama.cpp | Local Models and Inference | llama.cpp | ggml-org/llama.cpp |
| NVIDIA Model Optimizer | Local Models and Inference | NVIDIA Model Optimizer | NVIDIA/Model-Optimizer |
| English-level-up-tips | General Development, Operations, and Learning Resources | English-level-up-tips | byoungd/English-level-up-tips |
| ai-engineering-from-scratch | General Development, Operations, and Learning Resources | ai-engineering-from-scratch | rohitg00/ai-engineering-from-scratch |
| NGINX | General Development, Operations, and Learning Resources | NGINX | nginx/nginx |
| Trivy | General Development, Operations, and Learning Resources | Trivy | aquasecurity/trivy |
| Go | General Development, Operations, and Learning Resources | Go | golang/go |
| Jimmy | General Development, Operations, and Learning Resources | Jimmy | marph91/jimmy |
| Joplin | General Development, Operations, and Learning Resources | Joplin | laurent22/joplin |
How to Use This Table
If you are a developer, start with AI Coding and Coding Agents, Agent Skills and Workflows, and AI Application Development Infrastructure.
If you are building knowledge bases, private document QA, or enterprise RAG, start with RAG, Knowledge Bases, and Memory, especially OpenKB, PageIndex, MinerU, PaddleOCR, and turbovec.
If you work on content or interaction experiences, start with Multimodal and Content Creation. The newly added Open-LLM-VTuber is especially useful for studying local voice characters, Live2D, and LLM integration.