datasets weaviate/agents Viewer • Updated Jun 11, 2025 • 22.7k • 201 • 13 supermemory/xAFS Updated May 15 • 276 • 9 gaia-benchmark/GAIA Viewer • Updated Oct 28, 2025 • 932 • 26.9k • 705 GloriaaaM/LLM-Agent-Harness-Survey Viewer • Updated May 14 • 1 • 1.02k • 7
LLM Evals cais/mmlu Viewer • Updated Mar 8, 2024 • 231k • 429k • 778 ZhuofengLi/web-bench Viewer • Updated Jan 19 • 3.94k • 517 OpenResearcher/web-bench Viewer • Updated May 19 • 5.5k • 4.95k • 4 blazeofchi/pdf-ocr-rl-dataset Viewer • Updated Mar 1 • 4.24k • 80 • 1
datasets weaviate/agents Viewer • Updated Jun 11, 2025 • 22.7k • 201 • 13 supermemory/xAFS Updated May 15 • 276 • 9 gaia-benchmark/GAIA Viewer • Updated Oct 28, 2025 • 932 • 26.9k • 705 GloriaaaM/LLM-Agent-Harness-Survey Viewer • Updated May 14 • 1 • 1.02k • 7
LLM Evals cais/mmlu Viewer • Updated Mar 8, 2024 • 231k • 429k • 778 ZhuofengLi/web-bench Viewer • Updated Jan 19 • 3.94k • 517 OpenResearcher/web-bench Viewer • Updated May 19 • 5.5k • 4.95k • 4 blazeofchi/pdf-ocr-rl-dataset Viewer • Updated Mar 1 • 4.24k • 80 • 1