Skip to content

Local AI

GGUF Format

A binary file format for storing quantized LLM weights, used by llama.cpp and compatible runtimes. GGUF replaced the older GGML format and is the standard for sharing locally runnable open-weight models.