Hi there! 👋 I am Yaxin Luo.
About Me
Hello! I am a First-Year Machine Learning PhD student at MBZUAI under the supervision of Prof. Zhiqiang Shen. I am also closely working with my friend Xiaofu Chen. Previously, I received my Bachelor’s degree from Technical University of Denmark supervised by Prof. Dim P. Papadopoulos. My research interests span in :
- Multimodal Foundation Model: Developing native multimodal foundation models which can perform both understanding and generation tasks from video,language. (My Long-Term never changed research interest and belief)
- Efficient deep learning: I am interested in how to train or inference large models more efficiently, especially tackling the sparsity nature of nerual networks. (Support the first goal)
- Multimodal Agents: Innovate easy to use Multimodal Agents to automate complicated real-world complex tasks. (Application of the first goal)
News
- [2025-05-16] DRAG is accepted by ACL 2025 main conference!!
- [2025-01-22] γ-MoD is accepted by ICLR 2025, see you in Singapore!
Selected Publications
( * indicate equal contribution)
For full and up-to-date publication list, please refer to my Google Scholar page.
APL: Anchor-Based Prompt Learning for One-Stage Weakly Supervised Referring Expression Comprehension
γ-MoD: Exploring Mixture-of-Depth Adaptation for Multimodal Large Language Models
DViN: Dynamic Visual Routing Network for Weakly Supervised Referring Expression Comprehension