Models from the paper "LaSeR: Reinforcement Learning with Last-Token Self-Rewarding"
Wenkai Yang
Keven16
AI & ML interests
None yet
Recent Activity
authored a paper about 5 hours ago
Rethinking Continual Experience Internalization for Self-Evolving LLM Agents upvoted a paper about 18 hours ago
Rethinking Continual Experience Internalization for Self-Evolving LLM Agents submitted a paper about 18 hours ago
Rethinking Continual Experience Internalization for Self-Evolving LLM AgentsOrganizations
None yet