Haomin Wang ☕️
Haomin Wang

Research Intern

Shanghai AI Laboratory

About Me

Hi! My name is Haomin Wang(王昊旻). Currently I’m a Ph.D. student at Shanghai Jiao Tong University supervised by Prof. Jifeng Dai and Dr. Hongjie Zhang, focusing on Multimodal Large Language Models and Computer Vision. Meanwhile, I’m also a research intern at Shanghai AI Laborotary. My current research interests include applying MLLMs to vector-graphic generation and enhancing the reasoning capabilities of large multimodal models.

I’m currently looking for collaborations, feel free to contact me via email: wanghaomin@pjlab.org.cn.

Download CV
Interests
  • Multimodal Large Language Models
  • Computer Vision
Education
  • Ph.D. Artificial Intelligence

    Shanghai Jiao Tong University

  • B.Eng. Software Engineering

    Nanjing University

🔥 News
  • 2025/10: 🎉 We have released InternSVG, welcome to have a try!
  • 2025/09: 🎉 VecFormer and ArchCAD-400K are accepted by NeurIPS 2025!
  • 2025/08: 🎉 Our team released InternVL 3.5, welcome to have a try!
  • 2025/04: 🎉 Our team released InternVL 3, welcome to have a try!
Recent Publications
(2025). InternSpatial: A Comprehensive Dataset for Spatial Reasoning in Vision-Language Models.
(2025). Point or Line? Using Line-based Representation for Panoptic Symbol Spotting in CAD Drawings.
(2025). InternVL3: Exploring Advanced Training and Test-Time Recipes for Open-Source Multimodal Models.
(2025). ArchCAD-400K: An Open Large-Scale Architectural CAD Dataset and New Baseline for Panoptic Symbol Spotting.

Experience

  1. Research Intern

    Shanghai AI Laboratory

Education

  1. Ph.D. Artificial Intelligence

    Shanghai Jiao Tong University
  2. B.Eng. Software Engineering

    Nanjing University
    GPA: 4.59/5.00 Rank: Top 3%