Jeonghun

jh-y

https://sites.google.com/view/jeonghunyeo

AI & ML interests

Multimodal learning

Recent Activity

updated a model about 1 month ago

published a model about 1 month ago

authored a paper over 1 year ago

MMS-LLaMA: Efficient LLM-based Audio-Visual Speech Recognition with Minimal Multimodal Speech Tokens

View all activity

Organizations

None yet

Papers 3

arxiv:2503.11315

arxiv:2503.06273

arxiv:2402.15151

models 1

jh-y/dllm-vsr

datasets 0

None public yet