AVGen-Bench: A Task-Driven Benchmark for Multi-Granular Evaluation of Text-to-Audio-Video Generation Paper • 2604.08540 • Published Apr 9 • 5
Running on CPU Upgrade Agents Featured 110 Cohere Multilingual ASR 🎙 110 Transcribe audio clips to text in many languages
Running on Zero MCP 2.63k Wan2.2 14B Preview 🐌 2.63k generate a video from an image with a text prompt
Running on T4 Agents Featured 80 Trackers 🔥 80 Track objects in your video and get an annotated result