marsggbo/xsum_qwen1.5MoEA2.7B_token_real_and_predicted_patterns_t5-small Viewer • Updated Jun 16, 2025 • 11.3k • 47
marsggbo/wmt16_qwen1.5MoEA2.7B_token_real_and_predicted_patterns_t5-small Viewer • Updated Jun 16, 2025 • 10k • 25
marsggbo/alpaca_qwen1.5MoEA2.7B_token_real_and_predicted_patterns_t5-small Viewer • Updated Jun 16, 2025 • 10k • 80
marsggbo/wmt16_qwen1.5MoEA2.7B_token_real_and_predicted_patterns_Qwen1.5-0.5B-chat Viewer • Updated May 14, 2025 • 10k • 11
marsggbo/xsum_qwen1.5MoEA2.7B_token_real_and_predicted_patterns_Qwen1.5-0.5B-chat Viewer • Updated May 9, 2025 • 11.3k • 52
marsggbo/alpaca_qwen1.5MoEA2.7B_token_real_and_predicted_patterns_Qwen1.5-0.5B-chat Viewer • Updated May 8, 2025 • 10k • 4
marsggbo/xsum_mixtral8x7bInstructv0.1_token_real_and_predicted_patterns_t5-small_dff2048_dmodel32 Viewer • Updated Oct 5, 2024 • 11.3k • 82
marsggbo/wmt16_mixtral8x7bInstructv0.1_token_real_and_predicted_patterns_t5-small_dff2048_dmodel32 Viewer • Updated Oct 5, 2024 • 10k • 60
marsggbo/xsum_switch128_token_real_and_predicted_patterns_t5-small_dff2048_dmodel32 Viewer • Updated Oct 4, 2024 • 11.3k • 5
marsggbo/xsum_switch64_token_real_and_predicted_patterns_t5-small_dff2048_dmodel32 Viewer • Updated Sep 20, 2024 • 11.3k • 9
marsggbo/xsum_switch32_token_real_and_predicted_patterns_t5-small_dff2048_dmodel32 Viewer • Updated Sep 20, 2024 • 11.3k • 88
marsggbo/wmt16_switch128_token_real_and_predicted_patterns_t5-small_dff2048_dmodel32 Viewer • Updated Sep 20, 2024 • 10k • 3
marsggbo/wmt16_switch64_token_real_and_predicted_patterns_t5-small_dff2048_dmodel32 Viewer • Updated Sep 20, 2024 • 10k • 3
marsggbo/wmt16_switch32_token_real_and_predicted_patterns_t5-small_dff2048_dmodel32 Viewer • Updated Sep 20, 2024 • 10k • 51