LLaDA2.1: Speeding Up Text Diffusion via Token Editing Paper • 2602.08676 • Published 15 days ago • 66
OPUS: Towards Efficient and Principled Data Selection in Large Language Model Pre-training in Every Iteration Paper • 2602.05400 • Published 19 days ago • 326
Weak-Driven Learning: How Weak Agents make Strong Agents Stronger Paper • 2602.08222 • Published 15 days ago • 265
SWE-Universe: Scale Real-World Verifiable Environments to Millions Paper • 2602.02361 • Published 21 days ago • 60
Golden Goose: A Simple Trick to Synthesize Unlimited RLVR Tasks from Unverifiable Internet Text Paper • 2601.22975 • Published 25 days ago • 101
GDPO: Group reward-Decoupled Normalization Policy Optimization for Multi-reward RL Optimization Paper • 2601.05242 • Published Jan 8 • 227
Running on Zero Featured 268 granite-docling-258M demo 📝 268 Extract and convert document content from images
Nested Browser-Use Learning for Agentic Information Seeking Paper • 2512.23647 • Published Dec 29, 2025 • 19
DiRL: An Efficient Post-Training Framework for Diffusion Language Models Paper • 2512.22234 • Published Dec 23, 2025 • 22
TurboDiffusion: Accelerating Video Diffusion Models by 100-200 Times Paper • 2512.16093 • Published Dec 18, 2025 • 95
SaitBurak/Qwen3-4B-Thinking-2507-DeepSeek-v3.2-Speciale-Code-Distill-Q4_K_S-GGUF 4B • Updated Dec 14, 2025 • 142
SaitBurak/Qwen3-4B-Thinking-2507-DeepSeek-v3.2-Speciale-Code-Distill-Q4_K_S-GGUF 4B • Updated Dec 14, 2025 • 142
InfiniteVL: Synergizing Linear and Sparse Attention for Highly-Efficient, Unlimited-Input Vision-Language Models Paper • 2512.08829 • Published Dec 9, 2025 • 21
StereoWorld: Geometry-Aware Monocular-to-Stereo Video Generation Paper • 2512.09363 • Published Dec 10, 2025 • 72