Goku: A Million-Scale Universal Dataset and Benchmark for Instruction-Based Video Editing Paper • 2606.30599 • Published 4 days ago • 5
Retrospective Harness Optimization: Improving LLM Agents via Self-Preference over Trajectory Rollouts Paper • 2606.05922 • Published 30 days ago • 69