AI Tools Directory

Browse 42 hand-verified open-source AI tools. Every tool is a real, active GitHub repository.

Vision & Image Audio & Speech Content & Text Data & Analytics Workflow & Automation Coding & Development Infrastructure & DevOps

Stable Diffusion WebUI

157k

Browser interface for Stable Diffusion image generation. Prompt-to-image, inpainting, upscaling.

Vision & Imageimagegeneration

Kubernetes

113k

Production-grade container orchestration. Automates deployment, scaling, and management of containerized applications.

Infrastructure & DevOpscontainerorchestration

LangChain

105k

Build LLM-powered applications. Chains, agents, retrieval, memory for production AI apps.

Data & Analyticsllmframework

Whisper (OpenAI)

78k

General-purpose speech recognition. Transcribes audio to text in 100+ languages.

Audio & Speechspeechtranscription

llama.cpp

76k

LLM inference in C/C++. Run LLaMA, Mistral, Gemma models efficiently on CPU.

Coding & Developmentllminference

Open WebUI

75k

Self-hosted ChatGPT interface. Works with Ollama, OpenAI, any OpenAI-compatible API.

Workflow & Automationchatui

Ansible

64k

Agentless IT automation platform. Automate configuration management, application deployment, and task orchestration.

Infrastructure & DevOpsautomationconfig-management

n8n

61k

Fair-code workflow automation. 400+ integrations, visual builder, AI agent workflows, self-hostable.

Workflow & Automationautomationworkflow

Act

58k

Run GitHub Actions locally. Test your CI/CD pipelines on your own machine before pushing.

Infrastructure & DevOpsci-cdtesting

PrivateGPT

55k

Ask questions to your documents without internet. Secure RAG with local LLMs, 100% private document interaction.

Content & Textragdocument

FaceSwap

53.8k

Face swapping and face reenactment pipeline. Swap faces in images and video with AI.

Vision & Imagefaceswap

Segment Anything

49.4k

Meta's image segmentation model. Segment any object in any image with prompts.

Vision & Imagesegmentationimage

TTS (Coqui)

45.3k

Deep learning text-to-speech toolkit. 1100+ languages, multiple voices, fine-tunable models.

Audio & Speechttsspeech

Terraform

44k

Infrastructure as Code tool. Provision and manage cloud resources across 300+ providers declaratively.

Infrastructure & DevOpsiaccloud

Whisper.cpp

40.5k

High-performance speech-to-text in C/C++. Runs Whisper models locally on CPU efficiently.

Audio & Speechspeechtranscription

ColossalAI

40k

Large-scale AI training framework. Parallel training, fine-tuning, and inference of LLMs.

Coding & Developmenttrainingllm

Bark (Suno)

39.1k

Text-prompted generative audio model. Generates speech, music, sound effects from text.

Audio & Speechaudiospeech

LlamaIndex

39k

Data framework for LLM applications. Index, retrieve, and query your data for RAG.

Data & Analyticsragretrieval

Streamlit

38.5k

Turn Python scripts into web apps. Fastest way to build data/ML app UIs with pure Python.

Data & Analyticsweb-appdashboard

Gradio

37k

Build and share machine learning demos and web apps in Python. Create interactive model interfaces with a few lines of code.

Coding & Developmentuidemo

Claude Code Plugins

35k

Official Anthropic plugins for Claude Code. Frontend design, feature dev, security review, testing.

Coding & Developmentclaude-codeplugin

Docker Compose

35k

Define and run multi-container Docker applications. Orchestrates containers with simple YAML configuration.

Infrastructure & DevOpsdockercontainer

OpenPose

34.1k

Real-time multi-person keypoint detection. Detects body, face, hand poses from images/video.

Vision & Imageposekeypoint

ControlNet

33.9k

Control diffusion models with spatial conditioning. Adds pose, edge, depth controls to image/video generation.

Vision & Imageimagevideo

Certbot

32k

Automatically enable HTTPS on your server. EFF's tool to obtain and renew Let's Encrypt SSL certificates.

Infrastructure & DevOpssslsecurity

BART Summarizer

30.5k

Abstractive text summarization. Generates concise summaries of long documents using AI.

Content & Texttextsummarization

CrewAI

28k

Framework for orchestrating multi-agent AI teams. Define agents, tasks, and crews for autonomous workflows.

Workflow & Automationagentmulti-agent

AudioCraft

23.3k

Meta's audio generation library. MusicGen + AudioGen for music and sound effect generation.

Audio & Speechaudiomusic

Haystack

19k

LLM orchestration framework for building search and RAG pipelines. Open-source NLP framework with modular components.

Content & Textragsearch

ChromaDB

17.5k

AI-native embedding database. Store, search, and retrieve vector embeddings for semantic search.

Data & Analyticsvectordatabase

PandasAI

14k

Natural language querying for data analysis. Ask questions about your data in plain English, get Python/pandas code and results.

Content & Textdataanalysis

MoviePy

13.1k

Programmatic video editing in Python. Cut, compose, overlay, add text/audio to videos with code.

Vision & Imagevideoediting

Wav2Lip

13k

Lip-sync videos from audio. Generates accurate lip movements matching speech audio for any video.

Vision & Imagelip-syncaudio-to-video

AnimateDiff

12.1k

Text-to-video animation framework. Generates smooth animations from text prompts using diffusion models.

Vision & Imagevideoanimation

ActivePieces

12k

Open-source workflow automation tool. Build complex automations with visual editor, 200+ integrations.

Workflow & Automationautomationworkflow

Danswer

10k

Enterprise question answering over documents. Connects to Slack, Google Drive, Confluence for unified search.

Content & Textragdocuments

PyTorch3D

9.9k

3D deep learning library. Renders, manipulates, and optimizes 3D meshes, point clouds, and neural radiance fields.

Vision & Image3drendering

PIFuHD

9.8k

High-resolution 3D human reconstruction from single photo. Creates detailed textured avatars.

Vision & Image3dhuman

TypeChat

8.5k

Natural language to typed API calls. Extracts structured data from free-form text reliably.

Content & Textnlpstructured-output

Riffusion

3.9k

Stable diffusion for real-time music generation. Generates music from text prompts via spectrograms.

Audio & Speechmusicgeneration

SMPL-X

2.6k

Expressive 3D human body model. Full body, hand, and face parametric model for avatars.

Vision & Image3dhuman

Scaper

1.3k

Soundscape generation and synthesis. Mixes sound events to create realistic audio scenes.

Audio & Speechaudiosoundscape

Not sure which tool? Let AI find the perfect combo