GRAIL: Generating Humanoid Loco-Manipulation from 3D Assets and Video Priors Paper • 2606.05160 • Published 1 day ago • 1
NVIDIA OmniDreams: Real-Time Generative World Model for Closed-Loop Autonomous Vehicle Simulation Paper • 2606.03159 • Published 2 days ago • 13
Language Models Need Sleep: Learning to Self-Modify and Consolidate Memories Paper • 2606.03979 • Published 2 days ago • 16
Humanoid-GPT: Scaling Data and Structure for Zero-Shot Motion Tracking Paper • 2606.03985 • Published 2 days ago • 32
3DCodeBench: Benchmarking Agentic Procedural 3D Modeling Via Code Paper • 2606.01057 • Published 4 days ago • 5
Thinking in Blender: Staged Executable Inverse Graphics with Vision-Language Models Paper • 2606.02580 • Published 3 days ago • 1
StressDream: Steering Video World Models for Robust Policy Evaluation and Improvement Paper • 2606.00267 • Published 6 days ago • 2
Linear Scaling Video VLMs for Long Video Understanding Paper • 2605.31598 • Published 6 days ago • 11
Light Interaction: Training-Free Inference Acceleration for Interactive Video World Models Paper • 2605.31158 • Published 6 days ago • 1
Lumos-Nexus: Efficient Frequency Bridging with Homogeneous Latent Space for Video Unified Models Paper • 2605.31603 • Published 6 days ago • 6
Representation Forcing for Bottleneck-Free Unified Multimodal Models Paper • 2605.31604 • Published 6 days ago • 53
Why Larger Models Learn More: Effects of Capacity, Interference, and Rare-Task Retention Paper • 2605.29548 • Published 7 days ago • 9
WorldMemArena: Evaluating Multimodal Agent Memory Through Action-World Interaction Paper • 2605.29341 • Published 7 days ago • 15
minWM: A Full-Stack Open-Source Framework for Real-Time Interactive Video World Models Paper • 2605.30263 • Published 7 days ago • 55
Qwen-VLA: Unifying Vision-Language-Action Modeling across Tasks, Environments, and Robot Embodiments Paper • 2605.30280 • Published 7 days ago • 135
PEAM: Parametric Embodied Agent Memory through Contrastive Internalization of Experience in Minecraft Paper • 2605.27762 • Published 9 days ago • 7
Agent Explorative Policy Optimization for Multimodal Agentic Reasoning Paper • 2605.28774 • Published 8 days ago • 86