All Tools
llama.cpp
LLM inference in C/C++. Run LLaMA, Mistral, Gemma models efficiently on CPU.
View on GitHub76,000 stars
Install
git clone https://github.com/ggml-ai/llama.cppAlways review the official documentation before installing.
Tags
llminferencecpplocal
Related Tools
ColossalAI
Large-scale AI training framework. Parallel training, fine-tuning, and inference of LLMs.
40,000Gradio
Build and share machine learning demos and web apps in Python. Create interactive model interfaces with a few lines of code.
37,000Claude Code Plugins
Official Anthropic plugins for Claude Code. Frontend design, feature dev, security review, testing.
35,000