I received my Master's degree at Fudan University in 2025 under the supervision of Prof. Li Zhang, and earned my Bachelor's degree at Fudan University in 2022.
I'm interested in 3D reconstruction, generation and understanding.
The first unified framework that jointly supports understanding and generation for 3D modalities. Meanwhile, this work developed a geometry-aware vision encoder distillation strategy to enhance spatial perception.
An exploratory study on 3D scene understanding from multi-view images, introducing a spatial understanding benchmark that emphasizes numerical QA and feature matching across views, along with a multi-view 3D grounding method.
A method that models both rigid and non-rigid dynamic objects in urban scenes using Bézier curves, addressing inaccurate foreground poses and the challenges of modeling non-rigid objects.
An autonomous driving simulation system based on neural reconstruction and generation supporting scene editing, illumination adaptation, and foreground generation.
Combining the (coarse) planar rendering and the (fine) volume rendering to achieve higher rendering
quality and better generalizations. A depth teacher net that predicts dense
pseudo depth maps is used to supervise the joint rendering mechanism and boost the learning of
consistent 3D geometry.