wang binghai
refrain-wbh
AI & ML interests
None yet
Recent Activity
commentedon a paper about 11 hours ago
The Verification Horizon: No Silver Bullet for Coding Agent Rewards upvoted a paper about 22 hours ago
The Verification Horizon: No Silver Bullet for Coding Agent Rewards upvoted a paper 5 months ago
Outcome Accuracy is Not Enough: Aligning the Reasoning Process of Reward ModelsOrganizations
None yet