Running 107 Unlocking On-Policy Distillation for Any Model Family 📝 107 Visualize on-policy distillation for any model family
Running Featured 85 Distilling 100B+ Models 40x Faster with TRL 📝 85 TRL distillation for 100B+ teachers, 40x faster
VibeVoice Collection Frontier Text-to-Speech Models https://microsoft.github.io/VibeVoice/ • 8 items • Updated Mar 2 • 244
Running on CPU Upgrade Featured 3.18k The Smol Training Playbook 📚 3.18k The secrets to building world-class LLMs
Running 3.86k The Ultra-Scale Playbook 🌌 3.86k The ultimate guide to training LLM on large GPU Clusters
Running Agents 18 Chapter 1 Quiz - Transformers Fundementals 🔥 18 Test your knowledge of the Transformers Fundementals