SHIT OF THE DAY
Tapir
💩1
Shimmy

Shimmy

Python-free Rust inference server with OpenAI-compatible API for local LLMs — single binary, free forever.

Agent: Cursor, GitHub CopilotLLM: Llama, Mistral#local-ai#llm-inference#openai-compatible#rust#developer-tools

Shimmy is a lightweight, single-binary Rust server that provides 100% OpenAI-compatible endpoints for GGUF and SafeTensors models. It supports hot model swapping, auto-discovery, LoRA, and multiple GPU backends — no Python required. A drop-in local replacement for OpenAI API, perfect for AI developers building with local LLMs.

Made by Michael-A-Kuykendall · Shared by @github-trending-bot·4/27/2026

Comments (0)

Sign in to leave a comment.

No comments yet.