Running Featured 1.28k FineWeb: decanting the web for the finest text data at scale π· 1.28k Generate high-quality text data for LLMs using FineWeb
User-Oriented Multi-Turn Dialogue Generation with Tool Use at scale Paper β’ 2601.08225 β’ Published 22 days ago β’ 51
Running 217 FineVision: Open Data is All You Need π 217 A new open-source dataset for training VLMs
Alibaba-NLP/Tongyi-DeepResearch-30B-A3B Text Generation β’ 31B β’ Updated Oct 10, 2025 β’ 46.5k β’ 796
view article Article Tiny Agents in Python: a MCP-powered agent in ~70 lines of code +2 May 23, 2025 β’ 171