arxiv:2605.04045
Shengqiong Wu
ChocoWu
AI & ML interests
Large Language Model, Multimodal learning, Scene graph Generation
Recent Activity
liked a dataset about 1 hour ago
yanlinli/UniM authored a paper 22 days ago
Audio-Visual Intelligence in Large Foundation Models upvoted a paper 27 days ago
Audio-Visual Intelligence in Large Foundation Models