Helios: Real Real-Time Long Video Generation Model Paper • 2603.04379 • Published about 14 hours ago • 45
Code2World: A GUI World Model via Renderable Code Generation Paper • 2602.09856 • Published 23 days ago • 198
Learn Hard Problems During RL with Reference Guided Fine-tuning Paper • 2603.01223 • Published 4 days ago • 12
PersonaPlex: Voice and Role Control for Full Duplex Conversational Speech Models Paper • 2602.06053 • Published Jan 14 • 7
Beyond Language Modeling: An Exploration of Multimodal Pretraining Paper • 2603.03276 • Published 1 day ago • 61
OmniLottie: Generating Vector Animations via Parameterized Lottie Tokens Paper • 2603.02138 • Published 3 days ago • 123
From Scale to Speed: Adaptive Test-Time Scaling for Image Editing Paper • 2603.00141 • Published 9 days ago • 128
CUDA Agent: Large-Scale Agentic RL for High-Performance CUDA Kernel Generation Paper • 2602.24286 • Published 6 days ago • 73
Mobile-O: Unified Multimodal Understanding and Generation on Mobile Device Paper • 2602.20161 • Published 10 days ago • 23
Test-Time Training with KV Binding Is Secretly Linear Attention Paper • 2602.21204 • Published 9 days ago • 30
On Data Engineering for Scaling LLM Terminal Capabilities Paper • 2602.21193 • Published 9 days ago • 90
Does Your Reasoning Model Implicitly Know When to Stop Thinking? Paper • 2602.08354 • Published 24 days ago • 258
view article Article Nemotron 3 Nano \- A new Standard for Efficient, Open, and Intelligent Agentic Models Dec 15, 2025 • 109
view article Article How to Use Multiple GPUs in Hugging Face Transformers: Device Map vs Tensor Parallelism 21 days ago • 16