reinforcement-learning THU-KEG/LongWriter-Zero-32B Text Generation • 33B • Updated Jul 3, 2025 • 88 • • 113
Datasetets for general finetuning argilla/distilabel-capybara-dpo-7k-binarized Viewer • Updated Jul 16, 2024 • 7.56k • 6.1k • 184 Locutusque/function-calling-chatml Viewer • Updated Jul 16, 2024 • 113k • 1.31k • 177 pints-ai/Expository-Prose-V1 Viewer • Updated Aug 12, 2024 • 6.67M • 42 • 20
reinforcement-learning THU-KEG/LongWriter-Zero-32B Text Generation • 33B • Updated Jul 3, 2025 • 88 • • 113
Datasetets for general finetuning argilla/distilabel-capybara-dpo-7k-binarized Viewer • Updated Jul 16, 2024 • 7.56k • 6.1k • 184 Locutusque/function-calling-chatml Viewer • Updated Jul 16, 2024 • 113k • 1.31k • 177 pints-ai/Expository-Prose-V1 Viewer • Updated Aug 12, 2024 • 6.67M • 42 • 20