Hugo Academic CV Theme

☕️

Yuchen Li

PhD in Computer Vision

Mohamed Bin Zayed University of Artificial Intelligence

About Me

I am a PhD researcher in Computer Vision at Mohamed Bin Zayed University of Artificial Intelligence (MBZUAI). My research focuses on Multimodal Visual Language Models (VLMs) for image, video, 3D, and 4D generation and perception.

I am the first author of 3D-CoMPaT (ECCV Oral) and 3D-CoMPaT++ (TPAMI 2025, accepted). I also led the NeurIPS paper PointNeXt (1,000+ citations). My work Exploring Scaling Laws of PointNets received a Spotlight Talk at 3DV 2025.

I have served as a core organizer of a CVPR workshop and as a program chair and reviewer for leading AI conferences and journals such as TPAMI, IJCV, CVPR, ICCV, AAAI, TCSVT, and NeurIPS. I also contributed to Apache RocketMQ as an open-source developer.

I previously interned at Amazon Science (Prime Video, Seattle) and Sony AI (Tokyo).

Download CV

Interests

Multimodal Visual Language Models (VLMs)
Image / Video / 3D / 4D Generation and Perception
Diffusion Models
3D Vision

Education

PhD in Computer Vision
Mohamed Bin Zayed University of Artificial Intelligence
MSc in Computer Science
King Abdullah University of Science and Technology
BSc in Computer Science and Technology
Southern University of Science and Technology
School of Computing (Exchange Student)
National University of Singapore
Visiting Student (Electronics and Computer Engineering)
University of British Columbia

Featured Publications

3DCoMPaT++: An Improved Large-scale 3D Vision Dataset for Compositional Recognition

Recent Publications

Yuchen Li, Guocheng Qian, Houwen Peng, Jinjie Mai, Hasan Abed Al Kader Hammoud, Mohamed Elhoseiny, Bernard Ghanem (2025). Exploring Scaling Laws of PointNets. 3DV (Spotlight).

Habib Slim, Xiang Li, Yuchen Li, Mahmoud Ahmed, Mohamed Ayman, Ujjwal Upadhyay, Ahmed Abdelreheem, Arpit Prajapati, Suhail Pothigara, Peter Wonka, Mohamed Elhoseiny (2025). 3DCoMPaT++: An Improved Large-scale 3D Vision Dataset for Compositional Recognition. TPAMI.

Yuchen Li, Ujjwal Upadhyay, Habib Slim, Tezuesh Varshney, Ahmed Abdelreheem, Arpit Prajapati, Suhail Pothigara, Peter Wonka, Mohamed Elhoseiny (2022). 3D CoMPaT Dataset: Composition of Materials on Parts of 3D Things. ECCV (Oral).

Code

Yuchen Li, Guocheng Qian, Houwen Peng, Jinjie Mai, Hasan Abed Al Kader Hammoud, Mohamed Elhoseiny, Bernard Ghanem (2022). PointNeXt: Revisiting PointNets with Improved Training and Scaling Strategies. In NeurIPS.

Code

Ahmed Ayyad, Yuchen Li, Raden Muaz, Shadi Albarqouni, Mohamed Elhoseiny (2021). Semi-Supervised Few-Shot Learning with Prototypical RandomWalks. AAAI Workshop (Oral).

Experience

Applied Scientist Research Intern
Amazon Science - Prime Video July 2025 – November 2025
Developed a large visual-language model with 2D and 3D perception reasoning, bridging low-level perception with high-level reasoning.
Research Intern
Sony AI Research June 2024 – November 2024
Research on music-driven diffusion-based generative AI models, focusing on using music as guidance to generate synchronized human dance movements.
Assistant Researcher
Mohamed bin Zayed University of Artificial Intelligence September 2023 – Present
Explored 3D and video content generation and perception using diffusion and LLM models.
Innovation Associate
Dubai Business Associates - Emirates Airlines September 2022 – June 2023
- Selected as one of 30 associates from 10k applicants for the prestigious mini-MBA program
- Served as a consultant in Emirates Airlines’ Research Department
- Implemented a data-driven AI chatbot product
- Developed a 3-year cabin crew performance strategy presented to VPs
Research Assistant
KAUST - Vision CAIR Group August 2020 – August 2022
- Researched semi-supervised few-shot learning, meta-learning, and 3D object recognition
- Developed and participated in the creation of 3DCoMPat dataset and PointNeXt project
Visiting Research Student
Tsinghua University - Digital Manufacturing Lab June 2019 – July 2019
Led a team to research Micro-Organic Prediction Modeling of Cladding Process using CNNs.
Student Developer (Summer of Code) @ Apache
Apache RocketMQ March 2019 – July 2019
Developed the Apache RocketMQ JDBC Connector (23 commits, 3.4k LoC) as an independent developer (Alibaba Summer of Code).

Education

PhD in Computer Vision
Mohamed Bin Zayed University of Artificial Intelligence August 2023 – May 2027
Research focus on Multimodal Visual Language Models (VLMs) for image, video, 3D, and 4D generation and perception.
MSc in Computer Science
King Abdullah University of Science and Technology August 2020 – May 2022
GPA: 3.67/4.0 Thesis: Compositional and Low-shot Understanding of 3D Objects Visual Computing Center
BSc in Computer Science and Technology
Southern University of Science and Technology August 2017 – June 2021
GPA: 3.6/4.0 Thesis: Few-Shot Learning on 3D Object Recognition
School of Computing (Exchange Student)
National University of Singapore January 2020 – May 2020
Exchange student at NUS School of Computing.
Visiting Student (Electronics and Computer Engineering)
University of British Columbia July 2018 – August 2018
Visiting student at UBC.

PhD in Computer Vision

About Me

3DCoMPaT++: An Improved Large-scale 3D Vision Dataset for Compositional Recognition

3D CoMPaT Dataset: Composition of Materials on Parts of 3D Things

PointNeXt: Revisiting PointNets with Improved Training and Scaling Strategies

Experience

Applied Scientist Research Intern

Research Intern

Assistant Researcher

Innovation Associate

Research Assistant

Visiting Research Student

Student Developer (Summer of Code) @ Apache

Education

PhD in Computer Vision

MSc in Computer Science

BSc in Computer Science and Technology

School of Computing (Exchange Student)

Visiting Student (Electronics and Computer Engineering)