Create GGUF quantized model from a Hugging Face repo
Merge and audit large language models on low‑VRAM GPUs