datasets weaviate/agents Viewer • Updated Jun 11, 2025 • 22.7k • 204 • 13 supermemory/xAFS Updated May 15 • 276 • 9 gaia-benchmark/GAIA Viewer • Updated Oct 28, 2025 • 932 • 23.7k • 707 GloriaaaM/LLM-Agent-Harness-Survey Viewer • Updated May 14 • 1 • 970 • 7
Science LLM/ML Tools Running 23 ChemCrow 🐦 23 Generate chemical synthesis instructions and predictions doncamilom/OChemSegm-flan-T5-large Updated Sep 4, 2023 • 3 • 1
LLM Evals cais/mmlu Viewer • Updated Mar 8, 2024 • 231k • 430k • 779 ZhuofengLi/web-bench Viewer • Updated Jan 19 • 3.94k • 538 OpenResearcher/web-bench Viewer • Updated May 19 • 5.5k • 5.07k • 4 blazeofchi/pdf-ocr-rl-dataset Viewer • Updated Mar 1 • 4.24k • 81 • 1
datasets weaviate/agents Viewer • Updated Jun 11, 2025 • 22.7k • 204 • 13 supermemory/xAFS Updated May 15 • 276 • 9 gaia-benchmark/GAIA Viewer • Updated Oct 28, 2025 • 932 • 23.7k • 707 GloriaaaM/LLM-Agent-Harness-Survey Viewer • Updated May 14 • 1 • 970 • 7
LLM Evals cais/mmlu Viewer • Updated Mar 8, 2024 • 231k • 430k • 779 ZhuofengLi/web-bench Viewer • Updated Jan 19 • 3.94k • 538 OpenResearcher/web-bench Viewer • Updated May 19 • 5.5k • 5.07k • 4 blazeofchi/pdf-ocr-rl-dataset Viewer • Updated Mar 1 • 4.24k • 81 • 1
Science LLM/ML Tools Running 23 ChemCrow 🐦 23 Generate chemical synthesis instructions and predictions doncamilom/OChemSegm-flan-T5-large Updated Sep 4, 2023 • 3 • 1