evals dataets openai/mrcr Viewer • Updated Dec 8, 2025 • 2.4k • 7.25k • 200 zai-org/LongBench-v2 Viewer • Updated Dec 20, 2024 • 503 • 28.9k • 33 ibm-research/REAL-MM-RAG_FinReport Viewer • Updated Mar 16, 2025 • 2.93k • 290 • 6 dreamerdeo/finqa Viewer • Updated Mar 6, 2023 • 8.28k • 16.1k • 28
evals dataets openai/mrcr Viewer • Updated Dec 8, 2025 • 2.4k • 7.25k • 200 zai-org/LongBench-v2 Viewer • Updated Dec 20, 2024 • 503 • 28.9k • 33 ibm-research/REAL-MM-RAG_FinReport Viewer • Updated Mar 16, 2025 • 2.93k • 290 • 6 dreamerdeo/finqa Viewer • Updated Mar 6, 2023 • 8.28k • 16.1k • 28