Fine-tuning Multi-modal LLMs with ART: Art-based Reinforcement Training Paper • 2606.11854 • Published 1 day ago • 3
TRACE: A Unified Rollout Budget Allocation Framework for Efficient Agentic Reinforcement Learning Paper • 2606.11119 • Published 2 days ago • 15
InternVideo3: Agentify Foundation Models with Multimodal Contextual Reasoning Paper • 2606.12195 • Published 2 days ago • 17
On Subquadratic Architectures: From Applications to Principles Paper • 2606.12364 • Published 1 day ago • 20
World Pilot: Steering Vision-Language-Action Models with World-Action Priors Paper • 2606.12403 • Published 2 days ago • 22
Toward Generalist Autonomous Research via Hypothesis-Tree Refinement Paper • 2606.11926 • Published 2 days ago • 67
Redesign Mixture-of-Experts Routers with Manifold Power Iteration Paper • 2606.12397 • Published 2 days ago • 74
What Should Agents Say? Action-state Communication for Efficient Multi-Agent Systems Paper • 2606.05304 • Published 9 days ago • 5
SearchSwarm: Towards Delegation Intelligence in Agentic LLMs for Long-Horizon Deep Research Paper • 2606.09730 • Published 4 days ago • 49
SWE-Explore: Benchmarking How Coding Agents Explore Repositories Paper • 2606.07297 • Published 7 days ago • 108
Phase Marginalization for Patch-Grid Instability in Vision Transformers Paper • 2606.08132 • Published 5 days ago • 3
DuMate-DeepResearch: An Auditable Multi-Agent System with Recursive Search and Rubric-Grounded Reasoning Paper • 2606.07299 • Published 7 days ago • 6
LatentSkill: From In-Context Textual Skills to In-Weight Latent Skills for LLM Agents Paper • 2606.06087 • Published 8 days ago • 59