Lei (Max) Zhang - Personal Website

[New!] I join NVIDIA Research as a research intern in Summer 2026.

I am a second-year Ph.D. student in Computer Science & Engineering at University of California San Diego, working with Prof. Julian McAuley on tackling challenges in today’s multimodal large language model. Before that, I earned BS in Software Engineering from South China University of Technology and MS in Computer Science from Zhejiang University. I am recognized as a 🎖 Notable Reviewer at ICLR.

I had also spent time as a research scientist intern at Meta GenAI mentored by Zecheng He.

Research

Multimodal Intelligence — Advancing both the native capabilities and agentic wisdom of multimodal models to empower them with more sophisticated reasoning abilities in understanding and generation.

Scalable Training Methodology — Developing data curation, synthesis, evaluation and training strategies to scale multimodal models toward stronger and more generalizable intelligence.

Selected Publications

For a complete list of publications, please visit my Google Scholar.

ECCV 2026 #1 Paper of the day

Think in Strokes, Not Pixels: Process-driven Image Generation via Interleaved Reasoning

Lei Zhang, Junjiao Tian, Zhipeng Fan, Kunpeng Li, Jialiang Wang, Weifeng Chen, Markos Georgopoulos, Felix Juefei-Xu, Julian McAuley, Manling Li, Zecheng He

European Conference on Computer Vision (ECCV), 2026

ICLR 2025

LLaVA-MoD: Making LLaVA Tiny via MoE Knowledge Distillation

Fangxun Shu*, Yue Liao*, Lei Zhang*, Le Zhuo, Chenning Xu, Guanghao Zhang, Haonan Shi, et al.

International Conference on Learning Representations (ICLR), 2025

Tech Report

Audio-Visual LLM for Video Understanding

Fangxun Shu*, Lei Zhang*, Hao Jiang, Cihang Xie

arXiv preprint, 2024

Tech Report

Filter & Align: Leveraging Human Knowledge to Curate Image-Text Data

Lei Zhang, Fangxun Shu, Tianyang Liu, Sucheng Ren, Hao Jiang, Cihang Xie

arXiv preprint, 2024

ICCV 2023

Towards Fairness-aware Adversarial Network Pruning

Lei Zhang, Zhibo Wang, Xiaowei Dong, Yunhe Feng, Xiaoyi Pang, Zhifei Zhang, Kui Ren

IEEE International Conference on Computer Vision (ICCV), 2023

CVPR 2023 Highlight

Accelerating Dataset Distillation via Model Augmentation

Lei Zhang, Jie Zhang, Bowen Lei, Subhabrata Mukherjee, Xiang Pan, Bo Zhao, Caiwen Ding, Li Yao, Dongkuan Xu

IEEE / CVF Conference on Computer Vision and Pattern Recognition (CVPR), 2023 · Highlight

Miscellaneous

In my spare time, I am an avid reader, a dedicated foodie on a mission to find the best local eats, and a travel at heart. Find me on Instagram for a glimpse into my adventures and quietly refined life!