Self-hosted AI stack β LLM inference, chat, voice, agents, workflows, RAG, and image generation. No cloud required.
Local-first, peer-to-peer AI SDK for running LLMs and more privately across all platforms
Refreshingly fast local AI server β run LLMs privately on your own GPU or NPU for free
Free, local, unlimited AI music generation β the open source Suno alternative
Python-free Rust inference server with OpenAI-compatible API for local LLMs β single binary, free forever.
The most popular web interface for running Stable Diffusion locally with one click
Lightweight tensor library powering local LLM inference on any hardware
Find the perfect LLM model for your hardware in seconds β hundreds of models, one command
Local LLM inference server for Mac, optimized for AI coding workflows
Self-hosted coding assistant on a $500 GPU, no cloud required.
A fully local AI research assistant for iterative web search and report writing.