GitHub AI Open Source Project Categories: From Coding Agents to RAG, Local Voice Characters, and Automation Tools

This page groups GitHub AI projects by application direction, covering AI coding and Coding Agents, agent skills and workflows, RAG and knowledge bases, multimodal creation, local models and inference, vertical applications and automation, and AI application development infrastructure. New projects can be added later using the same structure.

Category Summary

Category	Projects	Best for
AI Coding and Coding Agents	31	Developers using Claude Code, Codex, Cursor, terminal Agents, or repo automation
Agent Skills and Workflows	13	People standardizing AI coding, research, job search, trend analysis, or creative workflows
RAG, Knowledge Bases, and Memory	13	People building document retrieval, knowledge bases, memory, OCR, parsing, and local vector indexes
Multimodal and Content Creation	11	People working with images, video, voice, digital humans, Live2D characters, and distribution
Vertical Applications and Automation	12	People exploring finance, trading, marketplace monitoring, desktop, browser, and mobile automation
AI Application Development Infrastructure	10	Builders of AI apps, Agent toolchains, browser automation, AI frontend interaction, and API gateways
Local Models and Inference	3	Users focused on local models, quantization, inference engines, and hardware adaptation
General Development, Operations, and Learning Resources	11	People filling gaps in infrastructure, frameworks, operations, security, and learning

AI Coding and Coding Agents

This group focuses on code understanding, code modification, engineering workflows, and terminal agents. It is the largest group, with 22 projects.

Project	Article	GitHub	Core Use	Best For
Ralph	Ralph: turning Claude Code and Amp into an autonomous development loop	snarktank/ralph	Drive Claude Code / Amp through PRD, planning, execution, and review loops	Users who want a straighter agent coding process
Claude-Mem	Claude-Mem: long-term cross-session memory for Claude Code	thedotmack/claude-mem	Add cross-session memory to Claude Code	Heavy Claude Code users
Claude Code Hooks Mastery	Claude Code Hooks Mastery: getting started with 13 hooks lifecycle stages	disler/claude-code-hooks-mastery	Learn Claude Code hooks lifecycle and automation control	Users who want to customize Claude Code workflows
Compound Engineering Plugin	Compound Engineering Plugin: turning AI coding into planning, execution, and review loops	EveryInc/compound-engineering-plugin	Split AI coding into planning, execution, and review cycles	Users who care about engineering discipline in AI coding
free-claude-code	free-claude-code: connecting Claude Code to OpenRouter, DeepSeek, and local models	Alishahryar1/free-claude-code	Use a proxy to connect Claude Code to different model backends	Users who want to reduce Claude Code cost
Hermes Agent	What is Hermes Agent: overview, strengths, quick start, and OpenClaw comparison	NousResearch/hermes-agent	Local agent framework with tool calling and task execution	Users who want to run local agents
OpenHarness	What OpenHarness can do as an open source agent harness	HKUDS/OpenHarness	Agent harness and multi-agent execution framework	Users researching agent orchestration
CodexBridge	Using Codex with domestic LLMs: OpenAI-compatible APIs and CodexBridge	begonia599/CodexBridge	Connect Codex to OpenAI-compatible model APIs	Users who want Codex with domestic models
ccx	Using CCX to manage OpenAI-compatible APIs for Codex and domestic models	BenedictKing/ccx	Manage API proxies for Claude, Codex, Gemini, and more	Multi-model switching users
cc-haha	cc-haha: a desktop workspace for Claude Code	NanmiCoder/cc-haha	Desktop workspace and Computer Use entry for Claude Code	Claude Code users who prefer a GUI
DeepSeek-TUI	DeepSeek-TUI: turning DeepSeek V4 into a terminal coding agent	Hmbown/DeepSeek-TUI	Run a DeepSeek coding agent in the terminal	DeepSeek and command-line users
Open Design	Open Design: turning Claude Code and Codex into AI design tools	nexu-io/open-design	Bring Claude Code / Codex into design generation	Users who want agents for design prototypes
agentmemory	agentmemory: persistent memory for Claude Code, Codex, and Cursor	rohitg00/agentmemory	Add persistent memory to coding agents	Developers maintaining long-running projects
Graphify	Graphify: turning a codebase into an AI-queryable knowledge graph	safishamsi/graphify	Convert a codebase into a knowledge graph to reduce repeated file reads	Large-codebase users
oh-my-pi	What is oh-my-pi? Setup guide for the AI coding assistant that connects terminal, IDE, and debugger	can1357/oh-my-pi	Connect terminal, IDE, LSP, and debugger as a local AI coding console	Developers who want to unify CLI and IDE workflows
Claude Plugins Official	Claude Code now has a plugin directory: what to install, how to install it, and what to watch	anthropics/claude-plugins-official	Official Claude Code plugin directory and installation entry point	Users who want to extend Claude Code
CodeGraph	What is CodeGraph? Local code knowledge graph setup, usage, and Claude/Codex tests	colbymchenry/codegraph	Generate local indexes and relationship graphs to help Coding Agents understand projects	Developers maintaining medium-to-large codebases
CC Switch	CC Switch: managing Claude Code, Codex, Gemini CLI, and OpenClaw in one desktop tool	farion1231/cc-switch	Manage multiple AI CLI tools and account/config switching	Users of multiple CLI tools
Warp	Warp open source: from terminal to Agentic Development Environment	warpdotdev/warp	Agentic terminal and development environment	Heavy terminal users
opencode	opencode vs Claude Code vs Codex: open source AI coding tools guide	anomalyco/opencode	Open source AI coding agent	Users looking for Claude Code / Codex alternatives
9Router	9Router: connecting Claude Code, Codex, and Cursor to one AI router	decolua/9router	AI coding model routing and token cost control	Multi-tool, multi-model users
goose	goose: an open source AI Agent across desktop, CLI, and API	aaif-goose/goose	Open source agent across desktop, CLI, and API	Users who want a general agent workspace

Agent Skills and Workflows

This group focuses on turning AI capabilities into repeatable skills, processes, and specifications. It includes 7 projects.

Project	Article	GitHub	Core Use	Best For
mattpocock/skills	Rejecting Vibe Coding: Matt Pocock’s skills repo adds engineering constraints to AI coding	mattpocock/skills	Use skills to constrain AI coding workflows	Users who want engineering discipline for agents
Superpowers	Superpowers: bringing coding agents back into engineering workflows	obra/superpowers	Agentic skills framework and software development methodology	Users who want systematic coding-agent workflows
Prompt-Vault	Prompt-Vault: a prompt specification library for testing AI coding ability	w512/Prompt-Vault	Collect prompt specs for testing AI coding ability	Model and tool evaluators
web-video-presentation	web-video-presentation: an agent skill for turning articles into recordable web videos	ConardLi/garden-skills	Turn articles into recordable web videos	Content creators and automation users
nuwa-skill	nuwa-skill: making “distilling a person” into an executable workflow	alchaincyf/nuwa-skill	Recreate a person’s expression and thinking flow with a skill	Users building style-based agents
Scientific Agent Skills	Scientific Agent Skills: giving research workflows to AI agents	K-Dense-AI/scientific-agent-skills	Skill collection for scientific workflows	Researchers, data analysts, and technical writers
easy-vibe	easy-vibe: a learning map for Vibe Coding beginners	datawhalechina/easy-vibe	Learning map for Vibe Coding	AI coding beginners

RAG, Knowledge Bases, and Memory

This group addresses document retrieval, knowledge base construction, long-term memory, and structured extraction. It includes 7 projects.

Project	Article	GitHub	Core Use	Best For
LangExtract	Google LangExtract: extracting structured data from long text with LLMs	google/langextract	Extract structured information from long text	Information extraction and data processing users
qmd	qmd: local Markdown document search for AI agents	tobi/qmd	Local Markdown document search	Users managing knowledge in Markdown
Firecrawl	Firecrawl: web search, crawling, and interaction API for AI agents	firecrawl/firecrawl	Web crawling, search, and structured data entry point	RAG and agent data-ingestion users
RAGFlow	RAGFlow: features and usage of an open source RAG engine	infiniflow/ragflow	Open source RAG engine	Enterprise knowledge base and document Q&A users
OpenHuman	OpenHuman: the desktop route for open source personal AI agents	tinyhumansai/openhuman	Local-first personal AI agent and memory layer	Users who want to integrate personal data
OpenKB	OpenKB: compiling documents into continuously updated LLM knowledge bases	VectifyAI/OpenKB	Compile documents into updatable knowledge bases	Documentation knowledge-base maintainers
PageIndex	PageIndex: reasoning-style RAG document indexing without vector databases	VectifyAI/PageIndex	Reasoning-style document indexing without vector databases	Users watching new RAG approaches

Multimodal and Content Creation

This group covers image, video, transcription, and content distribution scenarios. It includes 5 projects.

Project	Article	GitHub	Core Use	Best For
rembg	rembg: local image background removal tool	danielgatis/rembg	Local image background removal	E-commerce, design, and image-processing users
awesome-gpt-image-2-prompts	GPT-Image 2 prompt library: e-commerce, posters, portraits, and UI	EvoLinkAI/awesome-gpt-image-2-prompts	GPT-Image 2 prompts and case library	AI art and prompt users
faster-whisper	faster-whisper: a faster Whisper transcription engine	SYSTRAN/faster-whisper	High-performance speech-to-text	Subtitle, transcription, and speech-processing users
Pixelle-Video	Pixelle-Video: an open source AI engine for generating short videos from one topic	AIDC-AI/Pixelle-Video	One-topic short-video generation workflow	Short-video and AIGC creators
AiToEarn	Too many content platforms? AiToEarn uses AI agents to help creators save effort	yikart/AiToEarn	Multi-platform content distribution and creator automation	Content operators and creators

Local Models and Inference

This group focuses on local model runtime and inference experiments. It currently has fewer projects, with 1 project.

Project	Article	GitHub	Core Use	Best For
ds4	Running DeepSeek 4 locally: Antirez ds4 on Apple Silicon Mac	antirez/ds4	Experiment with running DeepSeek 4 on Apple Silicon	Local model and inference experiment users

Vertical Applications and Automation

This group applies agents or AI capabilities to finance, trading, browsers, desktops, e-commerce monitoring, and other concrete scenarios. It includes 7 projects.

Project	Article	GitHub	Core Use	Best For
TradingAgents-CN	TradingAgents-CN: a multi-agent financial trading research framework for Chinese users	hsliuping/TradingAgents-CN	Multi-agent financial trading research framework	Quant, finance, and agent researchers
FinceptTerminal	FinceptTerminal: open source financial terminal, quant research, and AI Agent workspace	Fincept-Corporation/FinceptTerminal	Financial terminal, quant research, and AI agent workspace	Financial analysis and quant users
Anthropic financial-services	Anthropic financial-services: reusable templates for financial agent scenarios	anthropics/financial-services	Financial services agent templates	Users building financial AI solutions
ai-goofish-monitor	ai-goofish-monitor: open source AI monitoring system for Xianyu products	Usagi-org/ai-goofish-monitor	AI product monitoring and Xianyu automation	Second-hand marketplace monitoring users
CloakBrowser	CloakBrowser: a more human-like browser for Playwright and Puppeteer	CloakHQ/CloakBrowser	More human-like browser automation environment	Browser automation and agent operation scenarios
UI-TARS-desktop	Let AI operate the computer? UI-TARS-desktop connects desktop, browser, and tools	bytedance/UI-TARS-desktop	Desktop, browser, and tool operation agent	Users who want AI to operate computers
AI-Trader	What is AI-Trader: a platform for AI agents to publish trading signals and run simulations	HKUDS/AI-Trader	AI agent trading signals and simulated trading platform	Financial agent and trading researchers

AI Application Development Infrastructure

This group provides foundational components for building AI applications and agent toolchains. It includes 5 projects.

Project	Article	GitHub	Core Use	Best For
Prompt Optimizer	Prompt Optimizer: open source prompt optimization, testing, and MCP tools	linshenkx/prompt-optimizer	Prompt optimization, testing, and MCP tools	Prompt engineering and app-tuning users
Playwright CLI	Playwright CLI basics: installation, skills, sessions, and common commands	microsoft/playwright-cli	Browser automation CLI for coding agents	Agent users who need browser operation
Vercel AI SDK	What is Vercel AI SDK? A unified toolkit for TypeScript AI apps	vercel/ai	TypeScript AI application SDK	Front-end and full-stack developers
CLIProxyAPI	CLIProxyAPI: wrapping Codex, Claude Code, and Gemini CLI into unified APIs	router-for-me/CLIProxyAPI	Wrap multiple AI CLIs and OAuth login states as compatible APIs	Users who want unified access to Codex, Claude Code, and Gemini CLI
CLIProxyAPI Management Center	CLIProxyAPI Management Center: a visual admin console for CLIProxyAPI	router-for-me/Cli-Proxy-API-Management-Center	Web admin UI for CLIProxyAPI configuration, accounts, logs, and OAuth	Users running CLIProxyAPI as a team gateway or account pool

Latest Additions From the Site

These projects were added after the older translated index and now bring the translated page in sync with the updated Chinese source.

Project	Category	Article	GitHub
OpenAI Symphony	AI Coding and Coding Agents	OpenAI Symphony	openai/symphony
GitHub Spec Kit	AI Coding and Coding Agents	GitHub Spec Kit	github/spec-kit
oh-my-codex	AI Coding and Coding Agents	oh-my-codex	Yeachan-Heo/oh-my-codex
RTK	AI Coding and Coding Agents	RTK	rtk-ai/rtk
Headroom	AI Coding and Coding Agents	Headroom	chopratejas/headroom
DeepSeek-Reasonix	AI Coding and Coding Agents	DeepSeek-Reasonix	esengine/DeepSeek-Reasonix
Understand-Anything	AI Coding and Coding Agents	Understand-Anything	Lum1104/Understand-Anything
Taste Skill	Agent Skills and Workflows	Taste Skill	Leonxlnx/taste-skill
academic-research-skills	Agent Skills and Workflows	academic-research-skills	Imbad0202/academic-research-skills
last30days-skill	Agent Skills and Workflows	last30days-skill	mvanhorn/last30days-skill
Agent-Reach	Agent Skills and Workflows	Agent-Reach	Panniantong/Agent-Reach
Personal AI Infrastructure	Agent Skills and Workflows	Personal AI Infrastructure	danielmiessler/Personal_AI_Infrastructure
career-ops	Agent Skills and Workflows	career-ops	santifer/career-ops
EverOS	RAG, Knowledge Bases, and Memory	EverOS	EverMind-AI/EverOS
MemPalace	RAG, Knowledge Bases, and Memory	MemPalace	MemPalace/mempalace
open-notebook	RAG, Knowledge Bases, and Memory	open-notebook	lfnovo/open-notebook
PaddleOCR	RAG, Knowledge Bases, and Memory	PaddleOCR	PaddlePaddle/PaddleOCR
MinerU	RAG, Knowledge Bases, and Memory	MinerU	opendatalab/MinerU
turbovec	RAG, Knowledge Bases, and Memory	turbovec	RyanCodrai/turbovec
HyperFrames	Multimodal and Content Creation	HyperFrames	heygen-com/hyperframes
OpenAI Whisper	Multimodal and Content Creation	OpenAI Whisper	openai/whisper
VibeVoice	Multimodal and Content Creation	VibeVoice	microsoft/VibeVoice
LongCat-Video	Multimodal and Content Creation	LongCat-Video	meituan-longcat/LongCat-Video
WavFlow	Multimodal and Content Creation	WavFlow	facebookresearch/WavFlow
Open-LLM-VTuber	Multimodal and Content Creation	Open-LLM-VTuber	Open-LLM-VTuber/Open-LLM-VTuber
xianyu-auto-reply	Vertical Applications and Automation	xianyu-auto-reply	zhinianboke/xianyu-auto-reply
MobiAgent	Vertical Applications and Automation	MobiAgent	IPADS-SAI/MobiAgent
Mobile-Agent	Vertical Applications and Automation	Mobile-Agent	X-PLUG/MobileAgent
Mobilerun	Vertical Applications and Automation	Mobilerun	droidrun/mobilerun
mobile-use	Vertical Applications and Automation	mobile-use	minitap-ai/mobile-use
browser-harness	AI Application Development Infrastructure	browser-harness	browser-use/browser-harness
CopilotKit	AI Application Development Infrastructure	CopilotKit	CopilotKit/CopilotKit
Vite	AI Application Development Infrastructure	Vite	vitejs/vite
Svelte	AI Application Development Infrastructure	Svelte	sveltejs/svelte
Microsoft MXC	AI Application Development Infrastructure	Microsoft MXC	microsoft/mxc
Remotion	AI Application Development Infrastructure	Remotion	remotion-dev/remotion
llama.cpp	Local Models and Inference	llama.cpp	ggml-org/llama.cpp
NVIDIA Model Optimizer	Local Models and Inference	NVIDIA Model Optimizer	NVIDIA/Model-Optimizer
English-level-up-tips	General Development, Operations, and Learning Resources	English-level-up-tips	byoungd/English-level-up-tips
ai-engineering-from-scratch	General Development, Operations, and Learning Resources	ai-engineering-from-scratch	rohitg00/ai-engineering-from-scratch
NGINX	General Development, Operations, and Learning Resources	NGINX	nginx/nginx
Trivy	General Development, Operations, and Learning Resources	Trivy	aquasecurity/trivy
Go	General Development, Operations, and Learning Resources	Go	golang/go
Jimmy	General Development, Operations, and Learning Resources	Jimmy	marph91/jimmy
Joplin	General Development, Operations, and Learning Resources	Joplin	laurent22/joplin

How to Use This Table

If you are a developer, start with AI Coding and Coding Agents, Agent Skills and Workflows, and AI Application Development Infrastructure.

If you are building knowledge bases, private document QA, or enterprise RAG, start with RAG, Knowledge Bases, and Memory, especially OpenKB, PageIndex, MinerU, PaddleOCR, and turbovec.

If you work on content or interaction experiences, start with Multimodal and Content Creation. The newly added Open-LLM-VTuber is especially useful for studying local voice characters, Live2D, and LLM integration.