WorldCompass: Reinforcement Learning for Long-Horizon World Models Paper • 2602.09022 • Published 5 days ago • 20
Self-Hinting Language Models Enhance Reinforcement Learning Paper • 2602.03143 • Published 12 days ago • 28
Diffusion Knows Transparency: Repurposing Video Diffusion for Transparent Object Depth and Normal Estimation Paper • 2512.23705 • Published Dec 29, 2025 • 45
Dream-VL & Dream-VLA: Open Vision-Language and Vision-Language-Action Models with Diffusion Language Model Backbone Paper • 2512.22615 • Published Dec 27, 2025 • 48
view article Article Why You Should Care About Partial Differential Equations (PDEs) Dec 12, 2025 • 41
Spatial-SSRL: Enhancing Spatial Understanding via Self-Supervised Reinforcement Learning Paper • 2510.27606 • Published Oct 31, 2025 • 30
Deep Reinforcement Learning Collection Features implementations and paces of popular RL algorithms and new paradigms on a variety of environments. • 7 items • Updated Nov 4, 2025
π_RL: Online RL Fine-tuning for Flow-based Vision-Language-Action Models Paper • 2510.25889 • Published Oct 29, 2025 • 66