Hi there! 👋 I am Yaxin Luo.

About Me

Hello! I am a First-Year Machine Learning PhD student at MBZUAI under the supervision of Prof. Zhiqiang Shen. I am also closely working with my friend Xiaofu Chen. Previously, I received my Bachelor’s degree from Technical University of Denmark supervised by Prof. Dim P. Papadopoulos. My research interests span in :

  • Multimodal Foundation Model: Developing native multimodal foundation models which can perform both understanding and generation tasks from video,language. (My Long-Term never changed research interest and belief)
  • Efficient deep learning: I am interested in how to train or inference large models more efficiently, especially tackling the sparsity nature of nerual networks. (Support the first goal)
  • Multimodal Agents: Innovate easy to use Multimodal Agents to automate complicated real-world complex tasks. (Application of the first goal)

News

  • [2025-05-16] DRAG is accepted by ACL 2025 main conference!!
  • [2025-01-22] γ-MoD is accepted by ICLR 2025, see you in Singapore!

Selected Publications

( * indicate equal contribution)

For full and up-to-date publication list, please refer to my Google Scholar page.

  • APL: Anchor-Based Prompt Learning for One-Stage Weakly Supervised Referring Expression Comprehension
    • ECCV 2024
    • Yaxin Luo,Jiayi Ji, Xiaofu Chen, Yuxin Zhang, Tianhe Ren, Gen Luo
    • PaperCode
  • γ-MoD: Exploring Mixture-of-Depth Adaptation for Multimodal Large Language Models
    • ICLR 2025
    • Yaxin Luo, Gen Luo, Jiayi Ji, Yiyi Zhou, Xiaoshuai Sun, Zhiqiang Shen, Rongrong Ji
    • PaperCode
  • DViN: Dynamic Visual Routing Network for Weakly Supervised Referring Expression Comprehension
    • CVPR 2025
    • Xiaofu Chen, Yaxin Luo, Gen Luo, Jiayi Ji, Henghui Ding, Yiyi Zhou
    • PaperCode