Shaocong Dong

I am a first-year CSE Ph.D. student of The Hong Kong University of Science and Technology, and my advisor is Prof. Dan Xu. I received both my Bachelor's and Master's degrees from Beijing Institute of Technology, supervised by Prof. Jianan Li. I spent a wonderful time in Qcraft and SenseTime as a research intern, mentored by Boyin Zhang and Dr. Zhanpeng Huang, respectively.

My current research interest lies in 3D vision, especially for high-quality, controllable 3D Generation with Diffusion Model and NeRF.

Wechat  /  Google Scholar  /  Github
Email: sdongae at cse dot ust dot hk or dongshaocong at outlook dot com

profile photo
Research

I have been working on Diffusion Model for 3D generation. My previous work also included 3D perception on Point Clouds. (*: Equal Contribution †: Corresponding Author)

Interactive3D: Create What You Want by Interactive 3D Generation
Shaocong Dong*, Lihe Ding*, Zhanpeng Huang, Zibin Wang, Tianfan Xue†, Dan Xu†
CVPR, 2024
project page / paper / video

Interactive3D grants users precise control over the generative process through extensive 3D interaction capabilities.

Text-to-3D Generation with Bidirectional Diffusion using both 2D and 3D priors
Lihe Ding*, Shaocong Dong*, Zhanpeng Huang, Zibin Wang†, Yiyuan Zhang, Kaixiong Gong, Dan Xu, Tianfan Xue
arXiv, 2023
project page / paper / video

We intergrate both 3D and 2D diffusion models with powerful priors into a unified framework with bidirectional guidance.

FH-Net: A Fast Hierarchical Network for Scene Flow Estimation on Real-world Point Clouds
Lihe Ding*, Shaocong Dong*, Tingfa Xu†, Xinli Xu, Jie Wang, Jianan Li
ECCV, 2022   (Oral Presentation)
project page / paper / video

We establish new lidar-scanned scene flow datasets and propose a fast and hierarchical network for real-world scene flow estimation.

MsSVT: Mixed-scale Sparse Voxel Transformer for 3D Object Detection on Point Clouds
Shaocong Dong*, Lihe Ding*, Haiyang Wang, Tingfa Xu†, Xinli Xu, Jie Wang, Ziyang Bian, Ying Wang, Jianan Li
NeurIPS, 2022
paper / code

We propose the first powerful 3D window-based transformer backbone on sparse 3D voxels leveraging mixed-scale information.

CAGroup3D: Class-Aware Grouping for 3D Object Detection on Point Clouds
Haiyang Wang*, Lihe Ding*, Shaocong Dong, Shaoshuai Shi†, Aoxue Li, Jianan Li, Zhenguo Li, Liwei Wang
NeurIPS, 2022
paper / code

We propose a novel class-aware 3D proposal generation strategy and an efficient fully sparse convolutional 3D refinement module for vote-based Indoor 3D Detection.

FusionRCNN: LiDAR-Camera Fusion for Two-stage 3D Object Detection
Xinli Xu, Shaocong Dong, Lihe Ding, Jie Wang, Tingfa Xu, Jianan Li
arXiv, 2022
paper / code

We propose a novel multi-modality two-stage approach to effectively and efficiently fuse point clouds and camera images in the Regions of Interest(RoI).

Education
PhD of CSE @ The Hong Kong University of Science and Technology
Sep. 2023 - Now
Advisor: Prof.Dan Xu
MSc in Opt-Electronics information Science and Engineeering @ Beijing Institute of Technology
Sep. 2020 - Jun. 2023
Advisor: Prof.Jianan Li
BSc in Opt-Electronics information Science and Engineeering @ Beijing Institute of Technology
Sep. 2016 - Jun. 2020
Experience
Metaverse Video R&D at SenseTime
Research Intern
Text-to-3D Generation using both 2D and 3D priors
May, 2023 - Sep, 2023
Mentor: Dr. Zhanpeng Huang
3DAR Group
Institute for Interdisciplinary Information Sciences (IIIS) at Tsinghua University
Research Assistant
3D Generation with diffusion model and implicit fuction
May, 2022 - Sep, 2022
Advisor: Prof. Li Yi
3D Visual Computing and Machine Intelligence (3DVICI) Lab
Qcraft
Research Intern
Beijing, China
May, 2021 - Mar, 2022
Mentor: Boyin Zhang
Perception & Machine Learning Group

Template from JonBarron