Haozhe Xie (谢浩哲)

Research Fellow at MMLab@NTU

College of Computing and Data Science
Nanyang Technological University

Emails:

  • General Inquiries: root [at] haozhexie [dot] com
  • Academic Matters: academic [at] haozhexie [dot] com

Google Scholar / Curriculum Vitae / GitHub / Twitter

Biography

I am currently working as a research fellow at MMLab@NTU, Nanyang Technological University, working with Prof. Ziwei Liu. Previously, I was a Senior Research Scientist at Tencent AI Lab from 2021 to 2023.
I received my Ph.D. from the VILab at Harbin Institute of Technology in 2021, supervised by Prof. Hongxun Yao. During my Ph.D., I also interned at SenseTime Research, mentored by Dr. Wenxiu Sun.
My research focuses on computer vision, with a particular emphasis on 3D vision and robotics.

Selected Publications
Highlight Image
Highlight Image
Highlight Image
MonoArt: Progressive Structural Reasoning for Monocular Articulated 3D Reconstruction
  • Haitian Li*
  • Haozhe Xie*
  • Junxiang Xu
  • Beichen Wen
  • Fangzhou Hong
  • Ziwei Liu

arXiv 2603.19231

Highlight Image
Highlight Image
Highlight Image
HSImul3R: Physics-in-the-Loop Reconstruction of Simulation-Ready Human–Scene Interactions
  • Yukang Cao
  • Haozhe Xie
  • Fangzhou Hong
  • Long Zhuo
  • Zhaoxi Chen
  • Liang Pan
  • Ziwei Liu

arXiv 2603.15612

Highlight Image
Highlight Image
Highlight Image
Bridging Semantic and Kinematic Conditions with Diffusion-based Discrete Motion Tokenizer
  • Chenyang Gu*
  • Mingyuan Zhang*
  • Haozhe Xie*
  • Zhongang Cai
  • Lei Yang
  • Ziwei Liu

arXiv 2603.19227

Highlight Image
Highlight Image
Highlight Image
InfiniteDance: Scalable 3D Dance Generation Towards in-the-wild Generalization
  • Ronghui Li
  • Zhongyuan Hu
  • Siyao Li
  • Youliang Zhang
  • Haozhe Xie
  • Mingyuan Zhang
  • Jie Guo
  • Xiu Li
  • Ziwei Liu

arXiv 2603.13375

Highlight Image
Highlight Image
Highlight Image
DynamicVLA: A Vision-Language-Action Model for Dynamic Object Manipulation
  • Haozhe Xie*
  • Beichen Wen*
  • Jiarui Zheng
  • Zhaoxi Chen
  • Fangzhou Hong
  • Haiwen Diao
  • Ziwei Liu

arXiv 2601.22153

Highlight Image
Highlight Image
Highlight Image
Compositional Generative Model of Unbounded 4D Cities
  • Haozhe Xie
  • Zhaoxi Chen
  • Fangzhou Hong
  • Ziwei Liu

IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI) 48.1 (2026): 312-328

Highlight Image
Highlight Image
Highlight Image
CrowdMoGen: Zero-Shot Text-Driven Collective Motion Generation
  • Yukang Cao*
  • Xinying Guo*
  • Mingyuan Zhang
  • Haozhe Xie
  • Chenyang Gu
  • Ziwei Liu

International Journal of Computer Vision (IJCV) 134.1 (2026): 29

Highlight Image
3D Scene Generation: A Survey
  • Beichen Wen*
  • Haozhe Xie*
  • Zhaoxi Chen
  • Fangzhou Hong
  • Ziwei Liu

arXiv 2505.05474

Highlight Image
Highlight Image
Highlight Image
Generative Gaussian Splatting for Unbounded 3D City Generation
  • Haozhe Xie
  • Zhaoxi Chen
  • Fangzhou Hong
  • Ziwei Liu

CVPR 2025

Highlight Image
Highlight Image
Highlight Image
3DTopia-XL: Scaling High-quality 3D Asset Generation via Primitive Diffusion
  • Zhaoxi Chen
  • Jiaxiang Tang
  • Yuhao Dong
  • Ziang Cao
  • Fangzhou Hong
  • Yushi Lan
  • Tengfei Wang
  • Haozhe Xie
  • Tong Wu
  • Shunsuke Saito
  • Liang Pan
  • Dahua Lin
  • Ziwei Liu

CVPR 2025

Highlight Image
Highlight Image
Highlight Image
DynamicCity: Large-Scale 4D Occupancy Generation from Dynamic Scenes
  • Hengwei Bian
  • Lingdong Kong
  • Haozhe Xie
  • Liang Pan
  • Yu Qiao
  • Ziwei Liu

ICLR 2025

Highlight Image
Multi-view Consistent 3D Panoptic Scene Understanding
  • Xianzhu Liu
  • Xin Sun
  • Haozhe Xie
  • Zonglin Li
  • Ru Li
  • Shengping Zhang

AAAI 2025

Highlight Image
2D Semantic-Guided Semantic Scene Completion
  • Xianzhu Liu
  • Haozhe Xie
  • Shengping Zhang
  • Hongxun Yao
  • Rongrong Ji
  • Liqiang Nie
  • Dacheng Tao

International Journal of Computer Vision (IJCV), 133.3 (2025): 1306-1325

Highlight Image
Highlight Image
Highlight Image
CityDreamer: Compositional Generative Model of Unbounded 3D Cities
  • Haozhe Xie
  • Zhaoxi Chen
  • Fangzhou Hong
  • Ziwei Liu

CVPR 2024

Highlight Image
Learning Geometric Transformation for Point Cloud Completion
  • Shengping Zhang
  • Xianzhu Liu
  • Haozhe Xie
  • Liqiang Nie
  • Huiyu Zhou
  • Dacheng Tao
  • Xuelong Li

International Journal of Computer Vision (IJCV), 131.9 (2023): 2425–2445

Highlight Image
Highlight Image
Highlight Image
Long-Range Feature Propagating for Natural Image Matting
  • Qinglin Liu
  • Haozhe Xie
  • Shengping Zhang
  • Bineng Zhong
  • Rongrong Ji

ACM Multimedia 2021

Highlight Image
Highlight Image
Highlight Image
3D Scene and Object Reconstruction from Multiple Sources and Viewpoints
  • Haozhe Xie

PhD Thesis, Harbin Institute of Technology, 2021

Highlight Image
Highlight Image
Highlight Image
Efficient Regional Memory Network for Video Object Segmentation
  • Haozhe Xie
  • Hongxun Yao
  • Shangchen Zhou
  • Shengping Zhang
  • Wenxiu Sun

CVPR 2021

Highlight Image
Highlight Image
Highlight Image
GRNet: Gridding Residual Network for Dense Point Cloud Completion
  • Haozhe Xie
  • Hongxun Yao
  • Shangchen Zhou
  • Jiageng Mao
  • Shengping Zhang
  • Wenxiu Sun

ECCV 2020

Highlight Image
Pix2Vox++: Multi-scale Context-aware 3D Object Reconstruction from Single and Multiple Images
  • Haozhe Xie
  • Hongxun Yao
  • Shengping Zhang
  • Shangchen Zhou
  • Wenxiu Sun

International Journal of Computer Vision (IJCV) 128.12 (2020): 2919-2935

Highlight Image
Pix2Vox: Context-aware 3D Reconstruction from Single and Multi-view Images
  • Haozhe Xie
  • Hongxun Yao
  • Xiaoshuai Sun
  • Shangchen Zhou
  • Shengping Zhang

ICCV 2019

Research Experience

Research Fellow
Mar 2023 - Present | MMLab@NTU, Nanyang Technological University
Working with Prof. Ziwei Liu.

NTU

Senior Research Scientist
Aug 2021 - Mar 2023 | Tencent AI Lab
Working with Dr. Hong Shang. Outstanding Contibutor (2022H1) & Excellent Individual (2022H2)

TecentAILab

Research Intern
Mar 2019 - Nov 2020 | SenseTime Research
Mentored by Dr. Wenxiu Sun. Outstanding Intern (2019H2)

SenseTime

Invited Talks
Academic Services
Teaching
  • NTU AI6126: Advanced Computer Vision, Teaching Assistant, Spring 2025.
  • HIT CS32261: Audio-Visual Signal Processing, Teaching Assistant, Fall 2018.