Experience

  1. Amazon Science - Prime Video logo

    Applied Scientist Research Intern

    Amazon Science - Prime Video
    Developed a large visual-language model with 2D and 3D perception reasoning, bridging low-level perception with high-level reasoning.
  2. Sony AI Research logo

    Research Intern

    Sony AI Research
    Research on music-driven diffusion-based generative AI models, focusing on using music as guidance to generate synchronized human dance movements.
  3. Mohamed bin Zayed University of Artificial Intelligence logo

    Assistant Researcher

    Mohamed bin Zayed University of Artificial Intelligence
    Explored 3D and video content generation and perception using diffusion and LLM models.
  4. Dubai Business Associates - Emirates Airlines logo

    Innovation Associate

    Dubai Business Associates - Emirates Airlines
    • Selected as one of 30 associates from 10k applicants for the prestigious mini-MBA program
    • Served as a consultant in Emirates Airlines’ Research Department
    • Implemented a data-driven AI chatbot product
    • Developed a 3-year cabin crew performance strategy presented to VPs
  5. KAUST - Vision CAIR Group logo

    Research Assistant

    KAUST - Vision CAIR Group
    • Researched semi-supervised few-shot learning, meta-learning, and 3D object recognition
    • Developed and participated in the creation of 3DCoMPat dataset and PointNeXt project
  6. Tsinghua University - Digital Manufacturing Lab logo

    Visiting Research Student

    Tsinghua University - Digital Manufacturing Lab
    Led a team to research Micro-Organic Prediction Modeling of Cladding Process using CNNs.
  7. Apache RocketMQ logo

    Student Developer (Summer of Code) @ Apache

    Apache RocketMQ
    Developed the Apache RocketMQ JDBC Connector (23 commits, 3.4k LoC) as an independent developer (Alibaba Summer of Code).

Education

  1. Mohamed Bin Zayed University of Artificial Intelligence logo

    PhD in Computer Vision

    Mohamed Bin Zayed University of Artificial Intelligence
    Research focus on Multimodal Visual Language Models (VLMs) for image, video, 3D, and 4D generation and perception.
  2. King Abdullah University of Science and Technology logo

    MSc in Computer Science

    King Abdullah University of Science and Technology
    GPA: 3.67/4.0 Thesis: Compositional and Low-shot Understanding of 3D Objects Visual Computing Center
  3. Southern University of Science and Technology logo

    BSc in Computer Science and Technology

    Southern University of Science and Technology
    GPA: 3.6/4.0 Thesis: Few-Shot Learning on 3D Object Recognition
  4. National University of Singapore logo

    School of Computing (Exchange Student)

    National University of Singapore
    Exchange student at NUS School of Computing.
  5. University of British Columbia logo

    Visiting Student (Electronics and Computer Engineering)

    University of British Columbia
    Visiting student at UBC.
Skills & Hobbies
Technical Skills
Python
Deep Learning
3D Vision
Research Areas
Computer Vision
Generative AI
Few-Shot Learning
Awards
Workshop Organizer - Compositional 3D Vision & 3DCoMPaT Challenge
CVPR 2023 ∙ June 2023
Organized a workshop on Compositional 3D Vision and the 3DCoMPaT Challenge at CVPR 2023 in Vancouver, Canada.
ZhenResidence from Zhenfund
Zhenfund ∙ July 2023
Selected for the Entrepreneurship Preparatory Camp Activities in Shanghai, China.
Super AI Youth
Jiangmen Innovation Ventures ∙ May 2021
Recognized by Jiangmen Innovation Ventures in Beijing, China.
Honor Mention, International Interdisciplinary Contest in Modeling (Top 16%)
International Interdisciplinary Contest in Modeling ∙ April 2019
Honor Mention (Top 16%).
2nd Prize, ASC World University Student Supercomputer Challenge (Top 10%)
ASC World University Student Supercomputer Challenge ∙ June 2019
2nd Prize (Top 10%).
Languages
95%
English
100%
Chinese