"Not all quantized model perform good", serving framework ollama uses NVIDIA gpu, llama.cpp uses CPU with AVX & AMX
v1k
xbruce22
AI & ML interests
None yet
Recent Activity
liked a model about 23 hours ago
Qwen/Qwen3.6-35B-A3B liked a model 2 days ago
unsloth/Kimi-K2.6-GGUF liked a model 2 days ago
moonshotai/Kimi-K2.6