arxiv:2504.19276
Chi PRO
ChilleD
AI & ML interests
Natural Language Processing.
Recent Activity
upvoted a paper about 7 hours ago
Retrospective Harness Optimization: Improving LLM Agents via Self-Preference over Trajectory Rollouts upvoted a paper 20 days ago
You Only Need Minimal RLVR Training: Extrapolating LLMs via Rank-1 Trajectories upvoted a paper 21 days ago
AutoResearchClaw: Self-Reinforcing Autonomous Research with Human-AI Collaboration