6 31 13

Bingxiang He

hbx

https://hbx-hbx.github.io/

AI & ML interests

NLP

Recent Activity

upvoted a paper about 8 hours ago

Rethinking the Role of Efficient Attention in Hybrid Architectures

liked a dataset 2 days ago

openbmb/MA-ProofBench

upvoted a collection 15 days ago

Rethinking OPD

View all activity

Organizations

upvoted a paper about 8 hours ago

Rethinking the Role of Efficient Attention in Hybrid Architectures

Paper • 2606.15378 • Published 5 days ago • 10

liked a dataset 2 days ago

openbmb/MA-ProofBench

Viewer • Updated 2 days ago • 200 • 192 • 6

upvoted a collection 15 days ago

Rethinking OPD

Collection

This collection includes the models used in the paper "Rethinking On-Policy Distillation of Large Language Models: Phenomenology, Mechanism, and Recip • 5 items • Updated 14 days ago • 3

upvoted a paper 20 days ago

Advancing Creative Physical Intelligence in Large Multimodal Models

Paper • 2605.26396 • Published 24 days ago • 19

liked a model 23 days ago

openbmb/MiniCPM5-1B

Text Generation • 1B • Updated 22 days ago • 160k • 806

upvoted a paper 29 days ago

Post-Trained MoE Can Skip Half Experts via Self-Distillation

Paper • 2605.18643 • Published about 1 month ago • 30

upvoted 2 papers about 1 month ago

Achieving Gold-Medal-Level Olympiad Reasoning via Simple and Unified Scaling

Paper • 2605.13301 • Published May 13 • 160

CreativityBench: Evaluating Agent Creative Reasoning via Affordance-Based Tool Repurposing

Paper • 2605.02910 • Published May 6 • 22

liked 2 models about 1 month ago

lllyx/Qwen3-4B-Base-GRPO

Text Generation • 4B • Updated May 3 • 364 • 3

lllyx/Qwen3-1.7B-SFT

Text Generation • 2B • Updated May 12 • 377 • 4

updated a collection about 2 months ago

JustRL

Collection

3 items • Updated May 2 • 5

commented a paper about 2 months ago

Rethinking On-Policy Distillation of Large Language Models: Phenomenology, Mechanism, and Recipe

Paper • 2604.13016 • Published Apr 14 • 110 •

authored a paper 2 months ago

Rethinking On-Policy Distillation of Large Language Models: Phenomenology, Mechanism, and Recipe

Paper • 2604.13016 • Published Apr 14 • 110

commented a paper 2 months ago

Rethinking On-Policy Distillation of Large Language Models: Phenomenology, Mechanism, and Recipe

Paper • 2604.13016 • Published Apr 14 • 110 •

upvoted a paper 2 months ago

Rethinking On-Policy Distillation of Large Language Models: Phenomenology, Mechanism, and Recipe

Paper • 2604.13016 • Published Apr 14 • 110

submitted a paper to Daily Papers 2 months ago

Rethinking On-Policy Distillation of Large Language Models: Phenomenology, Mechanism, and Recipe

Paper • 2604.13016 • Published Apr 14 • 110

commented a paper 3 months ago

How Far Can Unsupervised RLVR Scale LLM Training?

Paper • 2603.08660 • Published Mar 9 • 60 •

upvoted a paper 3 months ago

How Far Can Unsupervised RLVR Scale LLM Training?

Paper • 2603.08660 • Published Mar 9 • 60

submitted a paper to Daily Papers 3 months ago

How Far Can Unsupervised RLVR Scale LLM Training?

Paper • 2603.08660 • Published Mar 9 • 60

liked a model 4 months ago

openbmb/MiniCPM-SALA

Text Generation • 9B • Updated May 7 • 4.53k • 681

Bingxiang He

AI & ML interests

Recent Activity

Organizations

hbx's activity