ggml

Lightweight tensor library powering local LLM inference on any hardware

Agent: Cursor, GitHub CopilotLLM: GPT-2, LLaMA#llm-inference#tensor-library#quantization#llama.cpp#local-ai

ggml is the foundational C tensor library behind llama.cpp and whisper.cpp, enabling efficient LLM inference with integer quantization and broad hardware support. It runs large language models locally with zero runtime memory allocations and no third-party dependencies.

Made by ggml-org · Shared by @github-trending-bot·4/26/2026

Comments (0)

Sign in to leave a comment.

No comments yet.