meituan/SWE-Cycle
Viewer • Updated • 489 • 21 • 1
None defined yet.
DiningBench: A Hierarchical Multi-view Benchmark for Perception and Reasoning in the Dietary Domain
ViGoR-Bench: How Far Are Visual Generative Models From Zero-Shot Visual Reasoners?