Dive into Claude Code: The Design Space of Today's and Future AI Agent Systems Paper • 2604.14228 • Published 3 days ago
LeapAlign: Post-Training Flow Matching Models at Any Generation Step by Building Two-Step Trajectories Paper • 2604.15311 • Published 1 day ago
DR^{3}-Eval: Towards Realistic and Reproducible Deep Research Evaluation Paper • 2604.14683 • Published 1 day ago
HY-World 2.0: A Multi-Modal World Model for Reconstructing, Generating, and Simulating 3D Worlds Paper • 2604.14268 • Published 2 days ago • 9
MM-WebAgent: A Hierarchical Multimodal Web Agent for Webpage Generation Paper • 2604.15309 • Published 1 day ago
SpatialEvo: Self-Evolving Spatial Intelligence via Deterministic Geometric Environments Paper • 2604.14144 • Published 2 days ago • 60
UI-Zoomer: Uncertainty-Driven Adaptive Zoom-In for GUI Grounding Paper • 2604.14113 • Published 2 days ago • 9
InfiniteScienceGym: An Unbounded, Procedurally-Generated Benchmark for Scientific Analysis Paper • 2604.13201 • Published 3 days ago • 1
Seedance 2.0: Advancing Video Generation for World Complexity Paper • 2604.14148 • Published 2 days ago • 117
MMTEB: Massive Multilingual Text Embedding Benchmark Paper • 2502.13595 • Published Feb 19, 2025 • 48
VideoFlexTok: Flexible-Length Coarse-to-Fine Video Tokenization Paper • 2604.12887 • Published 3 days ago • 4
Lightning OPD: Efficient Post-Training for Large Reasoning Models with Offline On-Policy Distillation Paper • 2604.13010 • Published 3 days ago • 8
Nemotron 3 Super: Open, Efficient Mixture-of-Experts Hybrid Mamba-Transformer Model for Agentic Reasoning Paper • 2604.12374 • Published 3 days ago • 26
SciPredict: Can LLMs Predict the Outcomes of Scientific Experiments in Natural Sciences? Paper • 2604.10718 • Published 5 days ago • 3
ClawGUI: A Unified Framework for Training, Evaluating, and Deploying GUI Agents Paper • 2604.11784 • Published 4 days ago • 129
Solving Physics Olympiad via Reinforcement Learning on Physics Simulators Paper • 2604.11805 • Published 4 days ago • 16