Weihao Yuan – Weihao Yuan Homepage

I am Weihao Yuan (原玮浩), currently an algorithm expert at Alibaba. I obtained my PhD in Robotics Institute, Department of Electronic & Computer Engineering, Hong Kong University of Science and Technology (HKUST), supervised by Prof. Michael Yu Wang and co-supervised by Prof. Qifeng Chen. I visited the Robotics, Perception and Learning Lab, Royal Institute of Technology (KTH) for 6 months, supervised by Prof. Danica Kragic . I received my B.S. degree from College of Control Science and Engineering, Zhejiang University, majoring in Automation and minoring in Finance.

My research areas include Deep Reinforcement Learning, Embodied AI, 3D Reconstruction & Generation, 3D Human

News:

2025-06-26
Two paper accepted to International Conference on Computer Vision (ICCV 2025):
- LHM: Animatable Human Reconstruction from a Single Image in One Second
- Dirichlet-Constrained Variational Codebook Learning for Temporally Coherent Video Face Restoration

2025-04-20
One paper accepted to SIGGRAPH 2025:
- LAM: Large Avatar Model for One-shot Animatable Gaussian Head

2025-02-27
Two paper accepted to IEEE Conference on Computer Vision and Pattern Recognition (CVPR 2025):
- Motions as Queries: One-Stage Multi-Person Holistic Human Motion Capture
- AniGS: Animatable Gaussian Avatar from a Single Image with Inconsistent Gaussian Reconstruction

2025-01-23
One paper accepted to International Conference on Learning Representations (ICLR 2025):
- LaMP: Language-Motion Pretraining for Motion Generation, Retrieval, and Captioning

2024-10-10
Two papers accepted to Neural Information Processing Systems (NeurIPS):
- GIC: Gaussian-Informed Continuum for Physical Property Identification and Simulation Oral
- MoGenTS: Motion Generation based on Spatial-Temporal Joint Modeling

2024-07-30
One paper accepted to SIGGRAPH Asia & ACM Transactions on Graphics (TOG):
- MVImgNet2.0: A Larger-scale Dataset of Multi-view Images

2024-07-10
One paper accepted to IEEE Robotics and Automation Letters (RA-L):
- OV9D: Open-Vocabulary Category-Level 9D Object Pose and Size Estimation

2024-07-05
Three papers accepted to European Conference on Computer Vision (ECCV 2024):
- Freditor: High-Fidelity and Transferable NeRF Editing by Frequency Decomposition - An Optimization Framework to Enforce Multi-View Consistency for Texturing 3D Meshes Using Pre-Trained Text-2-Image Models - High-Fidelity 3D Textured Shapes Generation by Sparse Encoding and Adversarial Decoding

2024-02-27
Three papers accepted to IEEE Conference on Computer Vision and Pattern Recognition (CVPR 2024):
- IPoD: Implicit Field Learning with Point Diffusion for Generalizable 3D Object Reconstruction from Single RGB-D Images Highlight- RichDreamer: A Generalizable Normal-Depth Diffusion Model for Detail Richness in Text-to-3DHighlight- GPLD3D: Latent Diffusion of 3D Shape Generative Models by Enforcing Geometric and Physical Priors Oral

2023-03-06
One paper accepted to IEEE Robotics and Automation Letters (RA-L):
- DRO: Deep Recurrent Optimizer for Video to Depth

2023-01-21
Two papers accepted to International Conference on Learning Representations (ICLR 2023):
- 3D Former: Monocular Scene Reconstruction with 3D SDF Transformers - DIM-SLAM: DENSE RGB SLAM WITH NEURAL IMPLICIT MAPS

2022-03-03
Two papers accepted to IEEE Conference on Computer Vision and Pattern Recognition (CVPR 2022):
- NeW CRFs: Neural Window Fully-connected CRFs for Monocular Depth Estimation - RCP: Recurrent Closest Point for Scene Flow Estimation on 3D Point Clouds Oral

2021-07-01
One paper accepted to IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS 2021):
- Stereo Matching by Self-supervision of Multiscopic Vision

2020-07-01
Two papers accepted to IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS 2020):
- Self-supervised Object Tracking with Cycle-consistent Siamese Networks - Multi-Object Rearrangement with Monte Carlo Tree Search: A Case Study on Planar Nonprehensile Sorting

2020-01-24
One paper accepted to IEEE Robotics and Automation Letters (RA-L):
- MFuseNet: Robust Depth Estimation with Learned Multiscopic Fusion

2020-01-22
One paper accepted to IEEE International Conference on Robotics and Automation (ICRA 2020):
- MFuseNet: Robust Depth Estimation with Learned Multiscopic Fusion

2019-09-09
One paper accepted to Conference on Robot Learning (CoRL 2019) as oral presentation! (Acceptation rate: 5.3%):
- Toward Learning to Detect and Predict Contact Events on Vision-based Tactile Sensor

2019-06-06
One paper accepted to Robotics and Autonomous Systems (RAS):
- End-to-End Nonprehensile Rearrangement with Deep Reinforcement Learning and Simulation-to-Reality Transfer

2019-02-01
One paper accepted by IEEE International Conference on Robotics and Automation (ICRA 2019), May 20-24, 2019, Montreal, Canada:
- Reinforcement Learning in Topology-based Representation for Human Body Movement with Whole Arm Manipulation

2018-03-27
I have begun the visiting in KTH Royal Institute of Technology, Storkholm, Sweden, as a visiting researcher for next 6 months, working with Prof. Danica Kragic and Dr. Johannes Andreas Stork.

2018-01-12
One paper accepted to IEEE International Conference on Robotics and Automation (ICRA 2018), May 21-25, 2018, Brisbane, Australia.
- Rearrangement with Nonprehensile Manipulation Using Deep Reinforcement Learning