DCAgent3/terminal_bench_2_g1_diverse_tezos_top4_316_8b_20260602_100013 Viewer • Updated 16 days ago • 655 • 31 • 1
DelTA: Discriminative Token Credit Assignment for Reinforcement Learning from Verifiable Rewards Paper • 2605.21467 • Published 29 days ago • 204
Stream-T1: Test-Time Scaling for Streaming Video Generation Paper • 2605.04461 • Published May 6 • 107
Persistent Visual Memory: Sustaining Perception for Deep Generation in LVLMs Paper • 2605.00814 • Published May 1 • 21
Cut Your Losses! Learning to Prune Paths Early for Efficient Parallel Reasoning Paper • 2604.16029 • Published Apr 17 • 23
DR^{3}-Eval: Towards Realistic and Reproducible Deep Research Evaluation Paper • 2604.14683 • Published Apr 16 • 36
HDP: A Lightweight Cryptographic Protocol for Human Delegation Provenance in Agentic AI Systems Paper • 2604.04522 • Published Apr 6 • 10