AI Tools Directory
Browse 42 hand-verified open-source AI tools. Every tool is a real, active GitHub repository.
Stable Diffusion WebUI
157kBrowser interface for Stable Diffusion image generation. Prompt-to-image, inpainting, upscaling.
Kubernetes
113kProduction-grade container orchestration. Automates deployment, scaling, and management of containerized applications.
LangChain
105kBuild LLM-powered applications. Chains, agents, retrieval, memory for production AI apps.
Whisper (OpenAI)
78kGeneral-purpose speech recognition. Transcribes audio to text in 100+ languages.
llama.cpp
76kLLM inference in C/C++. Run LLaMA, Mistral, Gemma models efficiently on CPU.
Open WebUI
75kSelf-hosted ChatGPT interface. Works with Ollama, OpenAI, any OpenAI-compatible API.
Ansible
64kAgentless IT automation platform. Automate configuration management, application deployment, and task orchestration.
n8n
61kFair-code workflow automation. 400+ integrations, visual builder, AI agent workflows, self-hostable.
Act
58kRun GitHub Actions locally. Test your CI/CD pipelines on your own machine before pushing.
PrivateGPT
55kAsk questions to your documents without internet. Secure RAG with local LLMs, 100% private document interaction.
FaceSwap
53.8kFace swapping and face reenactment pipeline. Swap faces in images and video with AI.
Segment Anything
49.4kMeta's image segmentation model. Segment any object in any image with prompts.
TTS (Coqui)
45.3kDeep learning text-to-speech toolkit. 1100+ languages, multiple voices, fine-tunable models.
Terraform
44kInfrastructure as Code tool. Provision and manage cloud resources across 300+ providers declaratively.
Whisper.cpp
40.5kHigh-performance speech-to-text in C/C++. Runs Whisper models locally on CPU efficiently.
ColossalAI
40kLarge-scale AI training framework. Parallel training, fine-tuning, and inference of LLMs.
Bark (Suno)
39.1kText-prompted generative audio model. Generates speech, music, sound effects from text.
LlamaIndex
39kData framework for LLM applications. Index, retrieve, and query your data for RAG.
Streamlit
38.5kTurn Python scripts into web apps. Fastest way to build data/ML app UIs with pure Python.
Gradio
37kBuild and share machine learning demos and web apps in Python. Create interactive model interfaces with a few lines of code.
Claude Code Plugins
35kOfficial Anthropic plugins for Claude Code. Frontend design, feature dev, security review, testing.
Docker Compose
35kDefine and run multi-container Docker applications. Orchestrates containers with simple YAML configuration.
OpenPose
34.1kReal-time multi-person keypoint detection. Detects body, face, hand poses from images/video.
ControlNet
33.9kControl diffusion models with spatial conditioning. Adds pose, edge, depth controls to image/video generation.
Certbot
32kAutomatically enable HTTPS on your server. EFF's tool to obtain and renew Let's Encrypt SSL certificates.
BART Summarizer
30.5kAbstractive text summarization. Generates concise summaries of long documents using AI.
CrewAI
28kFramework for orchestrating multi-agent AI teams. Define agents, tasks, and crews for autonomous workflows.
AudioCraft
23.3kMeta's audio generation library. MusicGen + AudioGen for music and sound effect generation.
Haystack
19kLLM orchestration framework for building search and RAG pipelines. Open-source NLP framework with modular components.
ChromaDB
17.5kAI-native embedding database. Store, search, and retrieve vector embeddings for semantic search.
PandasAI
14kNatural language querying for data analysis. Ask questions about your data in plain English, get Python/pandas code and results.
MoviePy
13.1kProgrammatic video editing in Python. Cut, compose, overlay, add text/audio to videos with code.
Wav2Lip
13kLip-sync videos from audio. Generates accurate lip movements matching speech audio for any video.
AnimateDiff
12.1kText-to-video animation framework. Generates smooth animations from text prompts using diffusion models.
ActivePieces
12kOpen-source workflow automation tool. Build complex automations with visual editor, 200+ integrations.
Danswer
10kEnterprise question answering over documents. Connects to Slack, Google Drive, Confluence for unified search.
PyTorch3D
9.9k3D deep learning library. Renders, manipulates, and optimizes 3D meshes, point clouds, and neural radiance fields.
PIFuHD
9.8kHigh-resolution 3D human reconstruction from single photo. Creates detailed textured avatars.
TypeChat
8.5kNatural language to typed API calls. Extracts structured data from free-form text reliably.
Riffusion
3.9kStable diffusion for real-time music generation. Generates music from text prompts via spectrograms.
SMPL-X
2.6kExpressive 3D human body model. Full body, hand, and face parametric model for avatars.
Scaper
1.3kSoundscape generation and synthesis. Mixes sound events to create realistic audio scenes.