Running Featured 25 QED-Nano: Teaching a Tiny Model to Prove Hard Theorems 📝 25 Who needs 1T parameters? Olympiad proofs with a 4B model
MSign: An Optimizer Preventing Training Instability in Large Language Models via Stable Rank Restoration Paper • 2602.01734 • Published 15 days ago • 32
tabularisai/multilingual-sentiment-analysis Text Classification • 0.1B • Updated 11 days ago • 205k • • 358
🇩🇪German SFT and DPO datasets Collection Datasets that can be used for LLM training with axolotl, trl or llama_factory. • 33 items • Updated Jan 23, 2025 • 13