Haomin Wang ☕️
Haomin Wang

Research Intern

Shanghai AI Laboratory

About Me

Hi! My name is Haomin Wang(王昊旻). Currently I’m a Ph.D. student at Shanghai Jiao Tong University supervised by Dr. Kai Chen and Dr. Hongjie Zhang, focusing on Multimodal Large Language Models and Computer Vision. Meanwhile, I’m also a research intern at Shanghai AI Laborotary. My current research interests include applying MLLMs to vector-graphic generation and enhancing the reasoning capabilities of large multimodal models.

I’m currently looking for collaborations, feel free to contact me via email: wanghaomin@pjlab.org.cn.

Download CV
Interests
  • Multimodal Large Language Models
  • Computer Vision
Education
  • Ph.D. Artificial Intelligence

    Shanghai Jiao Tong University

  • B.Eng. Software Engineering

    Nanjing University

🔥 News
  • 2025/10: 🎉 We have released InternSVG, welcome to have a try!
  • 2025/09: 🎉 VecFormer and ArchCAD-400K are accepted by NeurIPS 2025!
  • 2025/08: 🎉 Our team released InternVL 3.5, welcome to have a try!
  • 2025/04: 🎉 Our team released InternVL 3, welcome to have a try!
Recent Publications
(2025). InternSVG: Towards Unified SVG Tasks with Multimodal Large Language Models.
(2025). InternVL3.5: Advancing Open-Source Multimodal Models in Versatility, Reasoning, and Efficiency.
(2025). InternSpatial: A Comprehensive Dataset for Spatial Reasoning in Vision-Language Models.
(2025). Point or Line? Using Line-based Representation for Panoptic Symbol Spotting in CAD Drawings.
(2025). InternVL3: Exploring Advanced Training and Test-Time Recipes for Open-Source Multimodal Models.

Experience

  1. Research Intern

    Shanghai AI Laboratory

Education

  1. Ph.D. Artificial Intelligence

    Shanghai Jiao Tong University
  2. B.Eng. Software Engineering

    Nanjing University
    GPA: 4.59/5.00 Rank: Top 3%