marcelweiss 's Collections Robotics
updated
Cosmos World Foundation Model Platform for Physical AI
Paper
• 2501.03575
• Published
• 82
VideoRefer Suite: Advancing Spatial-Temporal Object Understanding with
Video LLM
Paper
• 2501.00599
• Published
• 46
OmniManip: Towards General Robotic Manipulation via Object-Centric
Interaction Primitives as Spatial Constraints
Paper
• 2501.03841
• Published
• 56
Are VLMs Ready for Autonomous Driving? An Empirical Study from the
Reliability, Data, and Metric Perspectives
Paper
• 2501.04003
• Published
• 27
Real2Render2Real: Scaling Robot Data Without Dynamics Simulation or
Robot Hardware
Paper
• 2505.09601
• Published
• 6
villa-X: Enhancing Latent Action Modeling in Vision-Language-Action
Models
Paper
• 2507.23682
• Published
• 24
MolmoAct: Action Reasoning Models that can Reason in Space
Paper
• 2508.07917
• Published
• 44
Genie Envisioner: A Unified World Foundation Platform for Robotic
Manipulation
Paper
• 2508.05635
• Published
• 73
PANORAMA: The Rise of Omnidirectional Vision in the Embodied AI Era
Paper
• 2509.12989
• Published
• 28
FLOWER: Democratizing Generalist Robot Policies with Efficient
Vision-Language-Action Flow Policies
Paper
• 2509.04996
• Published
• 15
InternScenes: A Large-scale Simulatable Indoor Scene Dataset with
Realistic Layouts
Paper
• 2509.10813
• Published
• 31