Adam Improves Muon: Adaptive Moment Estimation with Orthogonalized Momentum Paper • 2602.17080 • Published 6 days ago • 1 • 2
Does Your Reasoning Model Implicitly Know When to Stop Thinking? Paper • 2602.08354 • Published 16 days ago • 167 • 6
4RC: 4D Reconstruction via Conditional Querying Anytime and Anywhere Paper • 2602.10094 • Published 14 days ago • 1 • 3
Generated Reality: Human-centric World Simulation using Interactive Video Generation with Hand and Camera Control Paper • 2602.18422 • Published 4 days ago • 22 • 5
ReIn: Conversational Error Recovery with Reasoning Inception Paper • 2602.17022 • Published 6 days ago • 1 • 2
Decoding as Optimisation on the Probability Simplex: From Top-K to Top-P (Nucleus) to Best-of-K Samplers Paper • 2602.18292 • Published 4 days ago • 10 • 5
Whom to Query for What: Adaptive Group Elicitation via Multi-Turn LLM Interactions Paper • 2602.14279 • Published 9 days ago • 1 • 2
Rubrics as an Attack Surface: Stealthy Preference Drift in LLM Judges Paper • 2602.13576 • Published 11 days ago • 1 • 2
VidEoMT: Your ViT is Secretly Also a Video Segmentation Model Paper • 2602.17807 • Published 5 days ago • 5 • 2
VESPO: Variational Sequence-Level Soft Policy Optimization for Stable Off-Policy LLM Training Paper • 2602.10693 • Published 13 days ago • 166 • 5
DeepVision-103K: A Visually Diverse, Broad-Coverage, and Verifiable Mathematical Dataset for Multimodal Reasoning Paper • 2602.16742 • Published 7 days ago • 3 • 2
Selective Training for Large Vision Language Models via Visual Information Gain Paper • 2602.17186 • Published 5 days ago • 1 • 2
EgoPush: Learning End-to-End Egocentric Multi-Object Rearrangement for Mobile Robots Paper • 2602.18071 • Published 4 days ago • 6 • 2
Learning Smooth Time-Varying Linear Policies with an Action Jacobian Penalty Paper • 2602.18312 • Published 4 days ago • 1 • 2
Spanning the Visual Analogy Space with a Weight Basis of LoRAs Paper • 2602.15727 • Published 7 days ago • 11 • 3