daVinci-Dev: Agent-native Mid-training for Software Engineering Paper • 2601.18418 • Published 1 day ago • 104
AgencyBench: Benchmarking the Frontiers of Autonomous Agents in 1M-Token Real-World Contexts Paper • 2601.11044 • Published 12 days ago • 34
LiveTalk: Real-Time Multimodal Interactive Video Diffusion via Improved On-Policy Distillation Paper • 2512.23576 • Published 29 days ago • 65