Probing suite for the generalization boundaries of VLA models. This collection holds model checkpoints and more.
Juexiao Zhang PRO
juexzz
AI & ML interests
Computer Vision, Robotics, Robot Perception, Representation Learning
Recent Activity
upvoted a paper about 7 hours ago
RepFusion: Leveraging Multimodal Priors for Denoising in Representation Space upvoted a paper about 2 months ago
CityRAG: Stepping Into a City via Spatially-Grounded Video Generation upvoted a paper 3 months ago
Beyond Language Modeling: An Exploration of Multimodal Pretraining