view article Article GGML and llama.cpp join HF to ensure the long-term progress of Local AI +4 13 days ago • 475
Optimize Weight Rounding via Signed Gradient Descent for the Quantization of LLMs Paper • 2309.05516 • Published Sep 11, 2023 • 11
Cerebras REAP Collection Sparse MoE models compressed using REAP (Router-weighted Expert Activation Pruning) method • 30 items • Updated 7 days ago • 127
Less is More: Recursive Reasoning with Tiny Networks Paper • 2510.04871 • Published Oct 6, 2025 • 509
view changelog Hugging Face Changelog Repositories total file size is now displayed Sep 18, 2025 • 176
view changelog Hugging Face Changelog Introducing HF Jobs: Run scalable compute jobs on Hugging Face Jul 30, 2025 • 202
view article Article Reachy Mini - The Open-Source Robot for Today's and Tomorrow's AI Builders Jul 9, 2025 • 785
DataDecide Collection A suite of models, data, and evals over 25 corpora, 14 sizes, and 3 seeds to measure how accurately small experiments predict rankings at large scale. • 354 items • Updated 2 days ago • 23