AnimalCLAP: Taxonomy-Aware Language-Audio Pretraining for Species Recognition and Trait Inference Paper • 2603.22053 • Published about 23 hours ago • 3
Group3D: MLLM-Driven Semantic Grouping for Open-Vocabulary 3D Object Detection Paper • 2603.21944 • Published 1 day ago • 22
LongCat-Flash-Prover: Advancing Native Formal Reasoning via Agentic Tool-Integrated Reinforcement Learning Paper • 2603.21065 • Published 2 days ago • 60
Omni-WorldBench: Towards a Comprehensive Interaction-Centric Evaluation for World Models Paper • 2603.22212 • Published about 21 hours ago • 101
MiroThinker-1.7 & H1: Towards Heavy-Duty Research Agents via Verification Paper • 2603.15726 • Published 8 days ago • 177
ProactiveBench: Benchmarking Proactiveness in Multimodal Large Language Models Paper • 2603.19466 • Published 5 days ago • 34
Nemotron-Cascade 2: Post-Training LLMs with Cascade RL and Multi-Domain On-Policy Distillation Paper • 2603.19220 • Published 5 days ago • 54
Reasoning Models Struggle to Control their Chains of Thought Paper • 2603.05706 • Published 19 days ago • 34
MolmoB0T: Large-Scale Simulation Enables Zero-Shot Manipulation Paper • 2603.16861 • Published 7 days ago • 6
SegviGen: Repurposing 3D Generative Model for Part Segmentation Paper • 2603.16869 • Published 7 days ago • 17
Lost in Stories: Consistency Bugs in Long Story Generation by LLMs Paper • 2603.05890 • Published 18 days ago • 91
CreativeBench: Benchmarking and Enhancing Machine Creativity via Self-Evolving Challenges Paper • 2603.11863 • Published 12 days ago • 6
Video Streaming Thinking: VideoLLMs Can Watch and Think Simultaneously Paper • 2603.12262 • Published 12 days ago • 30
HomeSafe-Bench: Evaluating Vision-Language Models on Unsafe Action Detection for Embodied Agents in Household Scenarios Paper • 2603.11975 • Published 12 days ago • 11