CrossWordBench: Evaluating the Reasoning Capabilities of LLMs and LVLMs with Controllable Puzzle Generation Paper • 2504.00043 • Published Mar 30, 2025 • 10
SE-DiCoW: Self-Enrolled Diarization-Conditioned Whisper Paper • 2601.19194 • Published 17 days ago • 3
OmegaUse: Building a General-Purpose GUI Agent for Autonomous Task Execution Paper • 2601.20380 • Published 16 days ago • 8
Z-Image: An Efficient Image Generation Foundation Model with Single-Stream Diffusion Transformer Paper • 2511.22699 • Published Nov 27, 2025 • 237
nvidia/NVIDIA-Nemotron-3-Nano-30B-A3B-Base-BF16 Text Generation • 32B • Updated 8 days ago • 70.3k • 103
Cosmos Policy: Fine-Tuning Video Models for Visuomotor Control and Planning Paper • 2601.16163 • Published 21 days ago • 14
CamCloneMaster: Enabling Reference-based Camera Control for Video Generation Paper • 2506.03140 • Published Jun 3, 2025 • 1
Wan-Move: Motion-controllable Video Generation via Latent Trajectory Guidance Paper • 2512.08765 • Published Dec 9, 2025 • 132
view post Post 1536 Wechat AI is shipping! WeDLM 🔥 A new language model that generates tokens in parallel, making it faster than standard LLMs , with the same Transformer setup! https://huggingface.co/collections/tencent/wedlm✨ 7B/8B - Base & Instruct✨ Apache 2.0 See translation 4 replies · 👍 6 6 + Reply
NeoVerse: Enhancing 4D World Model with in-the-wild Monocular Videos Paper • 2601.00393 • Published Jan 1 • 131