UniVideo: Unified Understanding, Generation, and Editing for Videos Paper • 2510.08377 • Published Oct 9, 2025 • 78
FantasyWorld: Geometry-Consistent World Modeling via Unified Video and 3D Prediction Paper • 2509.21657 • Published Sep 25, 2025 • 3
SOP: A Scalable Online Post-Training System for Vision-Language-Action Models Paper • 2601.03044 • Published 6 days ago • 26
SciEvalKit: An Open-source Evaluation Toolkit for Scientific General Intelligence Paper • 2512.22334 • Published 16 days ago • 34
MOSS Transcribe Diarize Collection A unified multimodal large language model for end-to-end speaker-attributed, time-stamped transcription. • 2 items • Updated 5 days ago • 1
MOSS Transcribe Diarize: Accurate Transcription with Speaker Diarization Paper • 2601.01554 • Published 8 days ago • 51
Falcon-H1R: Pushing the Reasoning Frontiers with a Hybrid Model for Efficient Test-Time Scaling Paper • 2601.02346 • Published 6 days ago • 24
Youtu-LLM: Unlocking the Native Agentic Potential for Lightweight Large Language Models Paper • 2512.24618 • Published 12 days ago • 129
Youtu-Agent: Scaling Agent Productivity with Automated Generation and Hybrid Policy Optimization Paper • 2512.24615 • Published 12 days ago • 108
Qianfan-VL: Domain-Enhanced Universal Vision-Language Models Paper • 2509.18189 • Published Sep 19, 2025 • 1
NEO1_0 Collection From Pixels to Words -- Towards Native Vision-Language Primitives at Scale • 7 items • Updated Oct 17, 2025 • 5
MiroThinker-v1.5 Collection MiroMind’s Flagship Search Agent Model • 4 items • Updated 5 days ago • 19
view article Article M2.1: Multilingual and Multi-Task Coding with Strong Generalization 7 days ago • 27