Qwen/DeepPlanning
Viewer • Updated • 2.14k • 688 • 195
None defined yet.
OccuBench: Evaluating AI Agents on Real-World Professional Tasks via Language World Models
RealChart2Code: Advancing Chart-to-Code Generation with Real Data and Multi-Task Evaluation