The ToolRL model trained for tool use through GRPO
Cheng Qian
chengq9
AI & ML interests
Agent, Tool Learning
Recent Activity
upvoted a paper 2 days ago
Brick-Composer: Using MLLMs for Assembly with Diverse Bricks upvoted a paper 5 days ago
Ψ-Bench: Evaluating Persona-Sensitive Influencing in Persuasive Dialogues