arxiv:2503.11315
Jeonghun
jh-y
AI & ML interests
Multimodal learning
Recent Activity
updated a model about 1 month ago
jh-y/dllm-vsr published a model about 1 month ago
jh-y/dllm-vsr authored a paper over 1 year ago
MMS-LLaMA: Efficient LLM-based Audio-Visual Speech Recognition with
Minimal Multimodal Speech TokensOrganizations
None yet