Shang Hong Sim
shanghong
AI & ML interests
Neural decoding, neuroengineering, signal processing
Organizations
models 13
shanghong/qwen3_8b_tatqa
1B • Updated • 4
shanghong/oumi_rag_grpo
Question Answering • 4B • Updated • 2
shanghong/llama3.1_8b_stage1
8B • Updated • 1
shanghong/qwen3_8b_stage1
8B • Updated • 3
shanghong/qwen3_4b_stage1
4B • Updated • 1
shanghong/stage1
Text Generation • 8B • Updated • 11 •
shanghong/q-FrozenLake-4x4-custom
Reinforcement Learning • Updated
shanghong/q-FrozenLake-4x4-test
Reinforcement Learning • Updated
shanghong/q-FrozenLake-custommap-v2
Updated
shanghong/q-FrozenLake-custommap
Reinforcement Learning • Updated
datasets 6
shanghong/oumi-web-agent
Viewer • Updated • 9.28k • 53
shanghong/oumi_rag_grpo_data
Viewer • Updated • 5.12k • 25
shanghong/llama_index_integration_data
Viewer • Updated • 21.1M • 11
shanghong/PRM800K_phase2_balanced
Viewer • Updated • 1.38M • 12
shanghong/PRM800K_train2_base_sft
Viewer • Updated • 97.8k • 8
shanghong/PRM800K_train2
Viewer • Updated • 966k • 44