I am a second-year Ph.D. student at the State Key Laboratory of CAD&CG, Zhejiang University, advised by Prof. Xiaowei Zhou. Currently, I am also a research intern at ByteDance Seed.
My recent research focuses on two directions:
1. Developing 3D/4D foundation models for reconstruction and spatial perception (see the SpatialTracker series);
2. Building 3D/4D capabilities for multimodal large language models by bridging foundation models with cognitive science principles.