GitHub AI Open Source Project Categories: From Coding Agents to RAG, Local Voice Characters, and Automation Tools

An updated site index of GitHub AI projects, covering AI coding, Agent workflows, RAG knowledge bases, multimodal content, local models, automation, and AI application infrastructure.

This page groups GitHub AI projects by application direction, covering AI coding and Coding Agents, agent skills and workflows, RAG and knowledge bases, multimodal creation, local models and inference, vertical applications and automation, and AI application development infrastructure. New projects can be added later using the same structure.

Category Summary

Category Projects Best for
AI Coding and Coding Agents 31 Developers using Claude Code, Codex, Cursor, terminal Agents, or repo automation
Agent Skills and Workflows 13 People standardizing AI coding, research, job search, trend analysis, or creative workflows
RAG, Knowledge Bases, and Memory 13 People building document retrieval, knowledge bases, memory, OCR, parsing, and local vector indexes
Multimodal and Content Creation 11 People working with images, video, voice, digital humans, Live2D characters, and distribution
Vertical Applications and Automation 12 People exploring finance, trading, marketplace monitoring, desktop, browser, and mobile automation
AI Application Development Infrastructure 10 Builders of AI apps, Agent toolchains, browser automation, AI frontend interaction, and API gateways
Local Models and Inference 3 Users focused on local models, quantization, inference engines, and hardware adaptation
General Development, Operations, and Learning Resources 11 People filling gaps in infrastructure, frameworks, operations, security, and learning

AI Coding and Coding Agents

This group focuses on code understanding, code modification, engineering workflows, and terminal agents. It is the largest group, with 22 projects.

Project Article GitHub Core Use Best For
Ralph Ralph: turning Claude Code and Amp into an autonomous development loop snarktank/ralph Drive Claude Code / Amp through PRD, planning, execution, and review loops Users who want a straighter agent coding process
Claude-Mem Claude-Mem: long-term cross-session memory for Claude Code thedotmack/claude-mem Add cross-session memory to Claude Code Heavy Claude Code users
Claude Code Hooks Mastery Claude Code Hooks Mastery: getting started with 13 hooks lifecycle stages disler/claude-code-hooks-mastery Learn Claude Code hooks lifecycle and automation control Users who want to customize Claude Code workflows
Compound Engineering Plugin Compound Engineering Plugin: turning AI coding into planning, execution, and review loops EveryInc/compound-engineering-plugin Split AI coding into planning, execution, and review cycles Users who care about engineering discipline in AI coding
free-claude-code free-claude-code: connecting Claude Code to OpenRouter, DeepSeek, and local models Alishahryar1/free-claude-code Use a proxy to connect Claude Code to different model backends Users who want to reduce Claude Code cost
Hermes Agent What is Hermes Agent: overview, strengths, quick start, and OpenClaw comparison NousResearch/hermes-agent Local agent framework with tool calling and task execution Users who want to run local agents
OpenHarness What OpenHarness can do as an open source agent harness HKUDS/OpenHarness Agent harness and multi-agent execution framework Users researching agent orchestration
CodexBridge Using Codex with domestic LLMs: OpenAI-compatible APIs and CodexBridge begonia599/CodexBridge Connect Codex to OpenAI-compatible model APIs Users who want Codex with domestic models
ccx Using CCX to manage OpenAI-compatible APIs for Codex and domestic models BenedictKing/ccx Manage API proxies for Claude, Codex, Gemini, and more Multi-model switching users
cc-haha cc-haha: a desktop workspace for Claude Code NanmiCoder/cc-haha Desktop workspace and Computer Use entry for Claude Code Claude Code users who prefer a GUI
DeepSeek-TUI DeepSeek-TUI: turning DeepSeek V4 into a terminal coding agent Hmbown/DeepSeek-TUI Run a DeepSeek coding agent in the terminal DeepSeek and command-line users
Open Design Open Design: turning Claude Code and Codex into AI design tools nexu-io/open-design Bring Claude Code / Codex into design generation Users who want agents for design prototypes
agentmemory agentmemory: persistent memory for Claude Code, Codex, and Cursor rohitg00/agentmemory Add persistent memory to coding agents Developers maintaining long-running projects
Graphify Graphify: turning a codebase into an AI-queryable knowledge graph safishamsi/graphify Convert a codebase into a knowledge graph to reduce repeated file reads Large-codebase users
oh-my-pi What is oh-my-pi? An AI coding assistant that connects terminal, IDE, and debugger can1357/oh-my-pi Connect terminal, IDE, LSP, and debugger as a local AI coding console Developers who want to unify CLI and IDE workflows
Claude Plugins Official Claude Code now has a plugin directory: what to install, how to install it, and what to watch anthropics/claude-plugins-official Official Claude Code plugin directory and installation entry point Users who want to extend Claude Code
CodeGraph What is CodeGraph? A local code map for Claude Code, Codex, and Cursor colbymchenry/codegraph Generate local indexes and relationship graphs to help Coding Agents understand projects Developers maintaining medium-to-large codebases
CC Switch CC Switch: managing Claude Code, Codex, Gemini CLI, and OpenClaw in one desktop tool farion1231/cc-switch Manage multiple AI CLI tools and account/config switching Users of multiple CLI tools
Warp Warp open source: from terminal to Agentic Development Environment warpdotdev/warp Agentic terminal and development environment Heavy terminal users
opencode opencode vs Claude Code vs Codex: open source AI coding tools guide anomalyco/opencode Open source AI coding agent Users looking for Claude Code / Codex alternatives
9Router 9Router: connecting Claude Code, Codex, and Cursor to one AI router decolua/9router AI coding model routing and token cost control Multi-tool, multi-model users
goose goose: an open source AI Agent across desktop, CLI, and API aaif-goose/goose Open source agent across desktop, CLI, and API Users who want a general agent workspace

Agent Skills and Workflows

This group focuses on turning AI capabilities into repeatable skills, processes, and specifications. It includes 7 projects.

Project Article GitHub Core Use Best For
mattpocock/skills Rejecting Vibe Coding: Matt Pocock’s skills repo adds engineering constraints to AI coding mattpocock/skills Use skills to constrain AI coding workflows Users who want engineering discipline for agents
Superpowers Superpowers: bringing coding agents back into engineering workflows obra/superpowers Agentic skills framework and software development methodology Users who want systematic coding-agent workflows
Prompt-Vault Prompt-Vault: a prompt specification library for testing AI coding ability w512/Prompt-Vault Collect prompt specs for testing AI coding ability Model and tool evaluators
web-video-presentation web-video-presentation: an agent skill for turning articles into recordable web videos ConardLi/garden-skills Turn articles into recordable web videos Content creators and automation users
nuwa-skill nuwa-skill: making “distilling a person” into an executable workflow alchaincyf/nuwa-skill Recreate a person’s expression and thinking flow with a skill Users building style-based agents
Scientific Agent Skills Scientific Agent Skills: giving research workflows to AI agents K-Dense-AI/scientific-agent-skills Skill collection for scientific workflows Researchers, data analysts, and technical writers
easy-vibe easy-vibe: a learning map for Vibe Coding beginners datawhalechina/easy-vibe Learning map for Vibe Coding AI coding beginners

RAG, Knowledge Bases, and Memory

This group addresses document retrieval, knowledge base construction, long-term memory, and structured extraction. It includes 7 projects.

Project Article GitHub Core Use Best For
LangExtract Google LangExtract: extracting structured data from long text with LLMs google/langextract Extract structured information from long text Information extraction and data processing users
qmd qmd: local Markdown document search for AI agents tobi/qmd Local Markdown document search Users managing knowledge in Markdown
Firecrawl Firecrawl: web search, crawling, and interaction API for AI agents firecrawl/firecrawl Web crawling, search, and structured data entry point RAG and agent data-ingestion users
RAGFlow RAGFlow: features and usage of an open source RAG engine infiniflow/ragflow Open source RAG engine Enterprise knowledge base and document Q&A users
OpenHuman OpenHuman: the desktop route for open source personal AI agents tinyhumansai/openhuman Local-first personal AI agent and memory layer Users who want to integrate personal data
OpenKB OpenKB: compiling documents into continuously updated LLM knowledge bases VectifyAI/OpenKB Compile documents into updatable knowledge bases Documentation knowledge-base maintainers
PageIndex PageIndex: reasoning-style RAG document indexing without vector databases VectifyAI/PageIndex Reasoning-style document indexing without vector databases Users watching new RAG approaches

Multimodal and Content Creation

This group covers image, video, transcription, and content distribution scenarios. It includes 5 projects.

Project Article GitHub Core Use Best For
rembg rembg: local image background removal tool danielgatis/rembg Local image background removal E-commerce, design, and image-processing users
awesome-gpt-image-2-prompts GPT-Image 2 prompt library: e-commerce, posters, portraits, and UI EvoLinkAI/awesome-gpt-image-2-prompts GPT-Image 2 prompts and case library AI art and prompt users
faster-whisper faster-whisper: a faster Whisper transcription engine SYSTRAN/faster-whisper High-performance speech-to-text Subtitle, transcription, and speech-processing users
Pixelle-Video Pixelle-Video: an open source AI engine for generating short videos from one topic AIDC-AI/Pixelle-Video One-topic short-video generation workflow Short-video and AIGC creators
AiToEarn Too many content platforms? AiToEarn uses AI agents to help creators save effort yikart/AiToEarn Multi-platform content distribution and creator automation Content operators and creators

Local Models and Inference

This group focuses on local model runtime and inference experiments. It currently has fewer projects, with 1 project.

Project Article GitHub Core Use Best For
ds4 Running DeepSeek 4 locally: Antirez ds4 on Apple Silicon Mac antirez/ds4 Experiment with running DeepSeek 4 on Apple Silicon Local model and inference experiment users

Vertical Applications and Automation

This group applies agents or AI capabilities to finance, trading, browsers, desktops, e-commerce monitoring, and other concrete scenarios. It includes 7 projects.

Project Article GitHub Core Use Best For
TradingAgents-CN TradingAgents-CN: a multi-agent financial trading research framework for Chinese users hsliuping/TradingAgents-CN Multi-agent financial trading research framework Quant, finance, and agent researchers
FinceptTerminal FinceptTerminal: open source financial terminal, quant research, and AI Agent workspace Fincept-Corporation/FinceptTerminal Financial terminal, quant research, and AI agent workspace Financial analysis and quant users
Anthropic financial-services Anthropic financial-services: reusable templates for financial agent scenarios anthropics/financial-services Financial services agent templates Users building financial AI solutions
ai-goofish-monitor ai-goofish-monitor: open source AI monitoring system for Xianyu products Usagi-org/ai-goofish-monitor AI product monitoring and Xianyu automation Second-hand marketplace monitoring users
CloakBrowser CloakBrowser: a more human-like browser for Playwright and Puppeteer CloakHQ/CloakBrowser More human-like browser automation environment Browser automation and agent operation scenarios
UI-TARS-desktop Let AI operate the computer? UI-TARS-desktop connects desktop, browser, and tools bytedance/UI-TARS-desktop Desktop, browser, and tool operation agent Users who want AI to operate computers
AI-Trader What is AI-Trader: a platform for AI agents to publish trading signals and run simulations HKUDS/AI-Trader AI agent trading signals and simulated trading platform Financial agent and trading researchers

AI Application Development Infrastructure

This group provides foundational components for building AI applications and agent toolchains. It includes 5 projects.

Project Article GitHub Core Use Best For
Prompt Optimizer Prompt Optimizer: open source prompt optimization, testing, and MCP tools linshenkx/prompt-optimizer Prompt optimization, testing, and MCP tools Prompt engineering and app-tuning users
Playwright CLI Playwright CLI basics: installation, skills, sessions, and common commands microsoft/playwright-cli Browser automation CLI for coding agents Agent users who need browser operation
Vercel AI SDK What is Vercel AI SDK? A unified toolkit for TypeScript AI apps vercel/ai TypeScript AI application SDK Front-end and full-stack developers
CLIProxyAPI CLIProxyAPI: wrapping Codex, Claude Code, and Gemini CLI into unified APIs router-for-me/CLIProxyAPI Wrap multiple AI CLIs and OAuth login states as compatible APIs Users who want unified access to Codex, Claude Code, and Gemini CLI
CLIProxyAPI Management Center CLIProxyAPI Management Center: a visual admin console for CLIProxyAPI router-for-me/Cli-Proxy-API-Management-Center Web admin UI for CLIProxyAPI configuration, accounts, logs, and OAuth Users running CLIProxyAPI as a team gateway or account pool

Latest Additions From the Site

These projects were added after the older translated index and now bring the translated page in sync with the updated Chinese source.

Project Category Article GitHub
OpenAI Symphony AI Coding and Coding Agents OpenAI Symphony openai/symphony
GitHub Spec Kit AI Coding and Coding Agents GitHub Spec Kit github/spec-kit
oh-my-codex AI Coding and Coding Agents oh-my-codex Yeachan-Heo/oh-my-codex
RTK AI Coding and Coding Agents RTK rtk-ai/rtk
Headroom AI Coding and Coding Agents Headroom chopratejas/headroom
DeepSeek-Reasonix AI Coding and Coding Agents DeepSeek-Reasonix esengine/DeepSeek-Reasonix
Understand-Anything AI Coding and Coding Agents Understand-Anything Lum1104/Understand-Anything
Taste Skill Agent Skills and Workflows Taste Skill Leonxlnx/taste-skill
academic-research-skills Agent Skills and Workflows academic-research-skills Imbad0202/academic-research-skills
last30days-skill Agent Skills and Workflows last30days-skill mvanhorn/last30days-skill
Agent-Reach Agent Skills and Workflows Agent-Reach Panniantong/Agent-Reach
Personal AI Infrastructure Agent Skills and Workflows Personal AI Infrastructure danielmiessler/Personal_AI_Infrastructure
career-ops Agent Skills and Workflows career-ops santifer/career-ops
EverOS RAG, Knowledge Bases, and Memory EverOS EverMind-AI/EverOS
MemPalace RAG, Knowledge Bases, and Memory MemPalace MemPalace/mempalace
open-notebook RAG, Knowledge Bases, and Memory open-notebook lfnovo/open-notebook
PaddleOCR RAG, Knowledge Bases, and Memory PaddleOCR PaddlePaddle/PaddleOCR
MinerU RAG, Knowledge Bases, and Memory MinerU opendatalab/MinerU
turbovec RAG, Knowledge Bases, and Memory turbovec RyanCodrai/turbovec
HyperFrames Multimodal and Content Creation HyperFrames heygen-com/hyperframes
OpenAI Whisper Multimodal and Content Creation OpenAI Whisper openai/whisper
VibeVoice Multimodal and Content Creation VibeVoice microsoft/VibeVoice
LongCat-Video Multimodal and Content Creation LongCat-Video meituan-longcat/LongCat-Video
WavFlow Multimodal and Content Creation WavFlow facebookresearch/WavFlow
Open-LLM-VTuber Multimodal and Content Creation Open-LLM-VTuber Open-LLM-VTuber/Open-LLM-VTuber
xianyu-auto-reply Vertical Applications and Automation xianyu-auto-reply zhinianboke/xianyu-auto-reply
MobiAgent Vertical Applications and Automation MobiAgent IPADS-SAI/MobiAgent
Mobile-Agent Vertical Applications and Automation Mobile-Agent X-PLUG/MobileAgent
Mobilerun Vertical Applications and Automation Mobilerun droidrun/mobilerun
mobile-use Vertical Applications and Automation mobile-use minitap-ai/mobile-use
browser-harness AI Application Development Infrastructure browser-harness browser-use/browser-harness
CopilotKit AI Application Development Infrastructure CopilotKit CopilotKit/CopilotKit
Vite AI Application Development Infrastructure Vite vitejs/vite
Svelte AI Application Development Infrastructure Svelte sveltejs/svelte
Microsoft MXC AI Application Development Infrastructure Microsoft MXC microsoft/mxc
Remotion AI Application Development Infrastructure Remotion remotion-dev/remotion
llama.cpp Local Models and Inference llama.cpp ggml-org/llama.cpp
NVIDIA Model Optimizer Local Models and Inference NVIDIA Model Optimizer NVIDIA/Model-Optimizer
English-level-up-tips General Development, Operations, and Learning Resources English-level-up-tips byoungd/English-level-up-tips
ai-engineering-from-scratch General Development, Operations, and Learning Resources ai-engineering-from-scratch rohitg00/ai-engineering-from-scratch
NGINX General Development, Operations, and Learning Resources NGINX nginx/nginx
Trivy General Development, Operations, and Learning Resources Trivy aquasecurity/trivy
Go General Development, Operations, and Learning Resources Go golang/go
Jimmy General Development, Operations, and Learning Resources Jimmy marph91/jimmy
Joplin General Development, Operations, and Learning Resources Joplin laurent22/joplin

How to Use This Table

If you are a developer, start with AI Coding and Coding Agents, Agent Skills and Workflows, and AI Application Development Infrastructure.

If you are building knowledge bases, private document QA, or enterprise RAG, start with RAG, Knowledge Bases, and Memory, especially OpenKB, PageIndex, MinerU, PaddleOCR, and turbovec.

If you work on content or interaction experiences, start with Multimodal and Content Creation. The newly added Open-LLM-VTuber is especially useful for studying local voice characters, Live2D, and LLM integration.

记录并分享
Built with Hugo
Theme Stack designed by Jimmy