Dexterous Point Policy: Learning Point-based Dexterous Hand Policies from Human Demonstrations Paper • 2606.10614 • Published 22 days ago • 25
Benchmarking Visual State Tracking in Multimodal Video Understanding Paper • 2606.03920 • Published 29 days ago • 52
Sommelier: Scalable Open Multi-turn Audio Pre-processing for Full-duplex Speech Language Models Paper • 2603.25750 • Published Mar 20 • 36