I am a Research Scientist at Google . I earned my Ph.D. in Electrical and Computer Engineering at University of California, Los Angeles (UCLA) in 2026, where I was gratefully advised by Prof. Achuta Kadambi. Previously, I obtained my M.S. in Electrical Engineering from Columbia University in 2021, where I was fortunate to be advised by Prof. John Wright. I received my B.E. in Electronic Information Engineering from University of Electronic Science and Technology of China (UESTC) in 2019.

My research focuses on 3D computer vision and multimodal generative modeling, advancing world models with spatial intelligence that integrate geometry, semantics, dynamics, and interactivity. My recent work extends visual reconstruction, generation, and reasoning from 2D into the 3D/4D domain, enabling physically grounded spatial intelligence through coherent scene representation and multimodal modeling for applications in mixed reality, robotics, and agentic AI. During my Ph.D., I also had the privilege of collaborating with Prof. Leonidas Guibas at Stanford University and Prof. Atlas Wang at the University of Texas at Austin.

My industry experience includes research internships at Apple Apple (2025) and Google (2024), as well as leading a research collaboration with (2025).

🔥 News

2026.02: Two papers accepted to CVPR 2026.
2025.11: VLM4D was selected as Best of ICCV by Voxel51 (the invited talk recording can be found here)! 🎉
2025.06: One paper accepted to ICCV 2025.
2025.05: Awarded Dissertation Year Award from UCLA.
2025.04: X-Dyna was selected as a Highlight paper at CVPR 2025 (2.98% of 13008 submissions)! 🎉
2025.02: Two papers accepted to CVPR 2025.
2025.02: 4K4DGen (4D version of DreamScene360) was selected as a Spotlight at ICLR 2025 (5.1% of 11565 submissions)! 🎉
2025.01: One paper accepted to ICLR 2025.
2024.10: Awarded J.B. Fourier Scholar in Vision and Graphics from UCLA.
2024.09: One paper accepted to NeurIPS 2024.
2024.07: One paper accepted to ECCV 2024.
2024.04: Feature 3DGS was selected as a Highlight paper at CVPR 2024 (2.8% of 11532 submissions)! 🎉
2024.02: One paper accepted to CVPR 2024.
2023.02: One paper accepted to CVPR 2023.
2021.09: Awarded Graduate Dean’s Scholar Award from UCLA.
2020.06: Awarded MS Honors Student from Columbia University.

📝 Selected Publications

* indicates equal contribution

ICCV 2025

VLM4D: Towards Spatiotemporal Awareness in Vision Language Models

Shijie Zhou*, Alexander Vilesov*, Xuehai He*, Ziyu Wan, Shuwang Zhang, Aditya Nagachandra, Di Chang, Dongdong Chen, Xin Eric Wang, Achuta Kadambi

Paper | Project

We propose the first benchmark explicitly designed to rigorously evaluate the spatiotemporal reasoning capabilities of Vision-Language Models (VLMs).

CVPR 2025

Feature4X: Bridging Any Monocular Video to 4D Agentic AI with Versatile Gaussian Feature Fields

Shijie Zhou*, Hui Ren*, Yijia Weng, Shuwang Zhang, Zhen Wang, Dejia Xu, Zhiwen Fan, Suya You, Zhangyang Wang, Leonidas Guibas, Achuta Kadambi

Paper | Project

Building 4D interactive scenes with agentic AI from monocular videos, by dynamically distilling model-conditioned features and integrating 2D foundation models with LLMs in feedback loops.

ECCV 2024

DreamScene360: Unconstrained Text-to-3D Scene Generation with Panoramic Gaussian Splatting

Shijie Zhou*, Zhiwen Fan*, Dejia Xu*, Haoran Chang, Pradyumna Chari, Tejas Bharadwaj, Suya You, Zhangyang Wang, Achuta Kadambi

Paper | Project

We introduce a 3D scene generation pipeline that creates immersive scenes with full 360$^{\circ}$ coverage from text prompts of any level of specificity.

CVPR 2024

Feature 3DGS: Supercharging 3D Gaussian Splatting to Enable Distilled Feature Fields

Shijie Zhou, Haoran Chang*, Sicheng Jiang*, Zhiwen Fan, Zehao Zhu, Dejia Xu, Pradyumna Chari, Suya You, Zhangyang Wang, Achuta Kadambi

Paper | Project (CVPR 2024 Highlight)

Feature 3DGS 🪄, distills feature fields from 2D foundation models, opening the door to a brand new semantic, editable, and promptable explicit 3D scene representation.

CVPR 2023

ALTO: Alternating Latent Topologies for Implicit 3D Reconstruction

Zhen Wang*, Shijie Zhou*, Jeong Joon Park, Despoina Paschalidou, Suya You, Gordon Wetzstein, Leonidas Guibas, Achuta Kadambi

Paper | Project

Rethinking latent topologies for fast and detailed implicit 3D reconstructions.

ICLR 2025 4K4DGen: Panoramic 4D Generation at 4K Resolution, Renjie Li, Panwang Pan, Bangbang Yang, Dejia Xu, Shijie Zhou, Xuanyang Zhang, Zeming Li, Achuta Kadambi, Zhangyang Wang, Zhengzhong Tu, Zhiwen Fan (Spotlight)

CVPR 2025 X-Dyna: Expressive Dynamic Human Image Animation, Di Chang, Hongyi Xu, You Xie, Yipeng Gao, Zhengfei Kuang, Shengqu Cai, Chenxu Zhang, Guoxian Song, Chao Wang, Yichun Shi, Zeyuan Chen, Shijie Zhou, Linjie Luo, Gordon Wetzstein, Mohammad Soleymani (Highlight)

NeurIPS 2024 Large Spatial Model: End-to-end Unposed Images to Semantic 3D, Zhiwen Fan, Jian Zhang, Wenyan Cong, Peihao Wang, Renjie Li, Kairun Wen, Shijie Zhou, Achuta Kadambi, Zhangyang Wang, Danfei Xu, Boris Ivanovic, Marco Pavone, Yue Wang

🎖 Honors and Awards

2025 Dissertation Year Award, UCLA
2024 J.B. Fourier Scholar in Vision and Graphics, UCLA
2021 Graduate Dean’s Scholar Award, UCLA
2020 MS Honors Student, Columbia University
2019 Outstanding Graduate, University of Electronic Science and Technology of China
2018 James Watt Scholar, University of Glasgow

💻 Work Experience

2025.04 - 2025.09, Research Intern at Apple
2024.06 - 2024.11, Student Researcher at
2023.06 - 2023.09, Visiting Academic at USC Institute for Creative Technologies

📖 Teaching

Teaching Assistant @ UCLA: ECE188 Computer Vision, ECE113 Digital Signal Processing
Teaching Assistant @ Columbia: EECS6690 Statistical Learning, ELEN6885 Reinforcement Learning
Teaching Assistant @ UoG & UESTC: 1008 Microelectronic Systems, 3010 Team Design Project and Skills

🖋️ Service

Conference Reviewer:
SIGGRAPH 2025, SIGGRAPH Asia 2025, CVPR 2025/2026, ICCV 2025, ECCV 2024/2026, NeurIPS 2025/2024, ICLR 2025, ICML 2025, 3DV 2025/2026, BMVC 2026
Journal Reviewer:
International Journal of Computer Vision, Transactions on Machine Learning Research, IEEE Transactions on Image Processing, IEEE Transactions on Multimedia, Pattern Recognition
Workshop Reviewer:
End-to-End 3D Learning @ ICCV 2025, AI for 3D Generation @ CVPR 2024, AI for 3D Content Creation @ ICCV 2023