Publications

(2026). Reliable Reasoning in SVG-LLMs via Multi-Task Multi-Reward Reinforcement Learning.
(2025). [ICLR 2026] InternSVG: Towards Unified SVG Tasks with Multimodal Large Language Models.
(2025). InternVL3.5: Advancing Open-Source Multimodal Models in Versatility, Reasoning, and Efficiency.
(2025). [ICLR 2026] InternSpatial: A Comprehensive Dataset for Spatial Reasoning in Vision-Language Models.
(2025). [NIPS 2025] Point or Line? Using Line-based Representation for Panoptic Symbol Spotting in CAD Drawings.
(2025). InternVL3: Exploring Advanced Training and Test-Time Recipes for Open-Source Multimodal Models.
(2025). [NIPS 2025] ArchCAD-400K: An Open Large-Scale Architectural CAD Dataset and New Baseline for Panoptic Symbol Spotting.