arxiv:2606.16154
Jesse Cresswell
JesseCresswell
ยท
AI & ML interests
None yet
Recent Activity
authored a paper about 3 hours ago
A Gradient Perspective on RLVR Stability and Winner Advantage Policy Optimization submitted a paper about 13 hours ago
A Gradient Perspective on RLVR Stability and Winner Advantage Policy Optimization upvoted a paper about 18 hours ago
A Gradient Perspective on RLVR Stability and Winner Advantage Policy Optimization