Haomin Wang ☕️
Haomin Wang

Research Intern

Shanghai AI Laboratory

About Me

Hi! My name is Haomin Wang(王昊旻). Currently I’m a Ph.D. student at Shanghai Jiao Tong University, focusing on Multimodal Large Language Models and Computer Vision. Meanwhile, I’m a research intern at Shanghai AI Laborotary. My current research interests include applying MLLMs to vector-graphic generation and enhancing the reasoning capabilities of large multimodal models.

Download CV
Interests
  • Multimodal Large Language Models
  • Computer Vision
Education
  • Ph.D. Artificial Intelligence

    Shanghai Jiao Tong University

  • B.Eng. in Software Engineering

    Nanjing University

Recent Publications
(2025). Point or Line? Using Line-based Representation for Panoptic Symbol Spotting in CAD Drawings.
(2025). InternVL3: Exploring Advanced Training and Test-Time Recipes for Open-Source Multimodal Models.
(2025). ArchCAD-400K: An Open Large-Scale Architectural CAD Dataset and New Baseline for Panoptic Symbol Spotting.

Experience

  1. Research Intern

    Shanghai AI Laboratory

Education

  1. Ph.D. Artificial Intelligence

    Shanghai Jiao Tong University
  2. B.Eng. in Software Engineering

    Nanjing University
    GPA: 4.59/5.00 Rank: Top 3%