I am a first-year Ph.D. student at KAIST, advised by Professor Joon Son Chung. My research centers on multimodal learning, with a particular focus on deepening the understanding and reasoning capabilities of multi-modal large language models (MLLMs).
I am also passionate about generative modeling in audio, including text-to-audio generation, as well as speech-related tasks such as speech enhancement, source separation, and lip-to-speech synthesis.
Powered by Jekyll and Minimal Light theme.