Mentor: Dr. Harsha Musunuri
Manager: Dr. Guan-ming Su.
Responsibilities include:
- Conducted research on Vision-Language Models (VLMs) for video understanding, with emphasis on bridging cutting-edge research and industrial applications.
- Explored deep learning methods for multimodal representation learning, fine-tuning, and evaluation on large-scale video datasets.
- Designed and implemented experimental pipelines to assess model performance, scalability, and practical deployment potential.
- Collaborated closely with senior researchers and engineers, contributing to forward-looking innovations in AI and multimedia technologies.