Run private, offline AI models on your own hardware
Run large language models locally with a simple CLI and REST API. Supports Llama, Mistral, Gemma, and dozens of other models out of the box.
β Pair Ollama with Open WebUI for a full browser-based chat interface
Jan is a free, open-source AI chat application that runs entirely on your local machine. Unlike cloud-based tools like ChatGPT, Jan sends no data to external servers β all inference happens on-device using local language models including LLaMA, Mistral, and other GGUF-format models. It supports Mac, Windows, and Linux and works fully offline. Jan is the go-to choice for developers and privacy-conscious users who want a self-hosted AI assistant with complete data sovereignty.
Offline-capable, self-hosted web interface for Ollama and OpenAI-compatible APIs. ChatGPT-like UI that runs entirely on your own machine.
β Connect Open WebUI to LocalAI to expose an OpenAI-compatible API for apps
Self-hosted, OpenAI-compatible API server for running AI models locally. Drop-in replacement for OpenAI β no GPU required.
Desktop app for running local LLMs with a ChatGPT-like interface. Supports GGUF models, model management, and a local API server compatible with OpenAI.
High-performance LLM inference in C++ enabling local AI on CPUs and Apple Silicon. The foundational engine powering most local AI tools.