Weihang Li

Hi, I'm a PhD student with TUM CAMP, PRS, MCML supervised by Prof. Benjamin Busam and Prof. Nassir Navab. During my study, I conducted research at CAMP, PRS with Prof. Olaf Wysocki, HKUST-GZ with Prof. Haoang Li and CVG with Prof. Daniel Cremers.

My research interests lie in the interplay between 3D computer vision and robotics, with a focus on object pose, 3D/4D reconstruction, world action model and Embodied AI.

Email  /  Google Scholar  /  Github  /  Linkedin

profile photo
Service

Research
OPT-Pose teaser
Object Pose Transformer: Unifying Unseen Object Pose Estimation
Weihang Li, Lorenzo Garattoni, Fabien Despinoy, Nassir Navab, Benjamin Busam

Computer Vision and Pattern Recognition Conference(CVPR), 2026
arXiv / Project Page / Code

A unified feed-forward transformer for unseen object pose estimation that jointly predicts depth, point maps, camera parameters, and NOCS to recover both absolute and relative poses.

HouseCat-TRICKY
HouseCat-TRICKY: HouseCat6D Object Pose Estimation Challenge with Specular and Transparent Objects
Weihang Li et al.

International Conference on Computer Vision (ICCVW), 2025
Paper / Challenge Page / Code

Our paper HouseCat-TRICKY has been accepted to ICCVW 2025. The benchmark and evaluation focus on challenging transparent and reflective object categories.

Results 2 for twoviewsfm
Results 1 for twoviewsfm
Texture2LoD3: Enabling LoD3 Building Reconstruction With Panoramic Images
Wenzhao Tang*, Weihang Li*, Xiucheng Liang, Olaf Wysocki, Filip Biljecki, Christoph Holst, Boris Jutzi

Computer Vision and Pattern Recognition Conference Workshop on Urban Scene Modeling (CVPRW), 2025
arXiv / Project Page / Code

Texture2LoD3 proposes leveraging ubiquitous street-level images and low-level building models for accurate ortho-texturing (left): Enabling accurate semantic segmentation (center) and facade-rich LoD3 reconstruction (right).

Results 2 for twoviewsfm
Results 1 for twoviewsfm
GCE-Pose: Global Context Enhancement for Category-level Object Pose Estimation
Weihang Li*, Hongli Xu*, Junwen Huang*, HyunJun Jung*, Peter KT Yu, Nassir Navab, Benjamin Busam

Computer Vision and Pattern Recognition Conference (CVPR), 2025
arXiv / Project Page

A semantic shape reconstruction module that recovers complete object geometry from partial observations with a global context-enhanced feature fusion mechanism that leverages category-level semantic and shape priors for robust pose prediction

Results 2 for twoviewsfm
Results 1 for twoviewsfm
DynSUP: Dynamic Gaussian Splatting from An Unposed Image Pair
Weihang Li*, Weirong Chen*, Shenhan Qian, Benjamin Busam, Daniel Cremers, Haoang Li

arXiv , 2024
arXiv / Project Page / Code

A novel method to achieve Gaussian splatting from an un-posed image pair in dynamic environments.

Results 2 for twoviewsfm
Results 1 for twoviewsfm
SCRREAM: SCan, Register, REnder And Map: A Framework for Annotating Accurate and Dense 3D Indoor Scenes with a Benchmark
HyunJun Jung, Weihang Li, Shun-Cheng Wu, William Bittner, Nikolas Brasch, Jifei Song, Eduardo Pérez-Pellitero, Zhensong Zhang, Arthur Moreau, Nassir Navab, Benjamin Busam

In Proceedings of the Neural Information Processing Systems (NeurIPS), 2024
arXiv / Project Page / Code

A framework to annotate accurate and dense 3d indoor scenes with a benchmark on novel view synthesis and SLAM

Results 2 for twoviewsfm
Results 1 for twoviewsfm
Knowledge-based Programming by Demonstration using semantic action models for industrial assembly
Junsheng Ding, Haifan, Zhang, Weihang Li, Liangwei Zhou, Alexander Perzylo

International Conference on Intelligent Robots and Systems (IROS), 2024
Paper / Project Page / Code / Video

A knowledge-based Programming by Demonstration (kb-PbD) paradigm to facilitate robot programming in small and medium-sized enterprises (SMEs).

Awards
  • [06-2024] Our team received an Honorable Mention Award in the S23DR Challenge at CVPR 2024.
  • [10-2025] Mentored student Haoliang to win 1st place in the WLCOP Challenge at ICCV 2025.

Experiences

TUM Photogrammetry & Remote Sensing Mentor: Olaf Wysocki 04/2024 - 03/2025
fortiss Robotics Lab Mentors: Junsheng Ding, Alexander Perzylo 10/2022 - 09/2023

Teaching
    Teaching Assistant: Introduction to Machine Learning, TUM, 2024.
    Teaching Assistant: Advanced Topics in 3D Computer Vision, TUM, 2025.
    Teaching Assistant: Modern Computer Vision Methods, TUM, 2025.
Last updated: April 2026