Weihang Li

Hi, I'm a PhD student with TUM CAMP, PRS, MCML supervised by Prof. Benjamin Busam and Prof. Nassir Navab. During my study, I conducted research at CAMP, PRS with Prof. Olaf Wysocki, HKUST-GZ with Prof. Haoang Li and CVG with Prof. Daniel Cremers.

My research interests lie in the interplay between 3D computer vision and robotics, with a focus on object pose, 3D/4D reconstruction, world action model and Embodied AI.

Email / Google Scholar / Github / Linkedin

Service

Conference Reviewer: CVPR, ICCV, ECCV, BMVC,ICRA, IROS, NeurIPS.
Workshop/Challenge Organizer: Co-organized Category-Level Object Pose Estimation in the Wild and Transparent & Reflective Objects in the Wild Challenge (ICCV 2025).

Research

	Object Pose Transformer: Unifying Unseen Object Pose Estimation Weihang Li, Lorenzo Garattoni, Fabien Despinoy, Nassir Navab, Benjamin Busam Computer Vision and Pattern Recognition Conference(CVPR), 2026 🏆 State-of-the-art performance on unseen-object absolute and relative pose estimation benchmarks arXiv / Project Page / Code A unified feed-forward transformer for unseen object pose estimation that jointly predicts depth, point maps, camera parameters, and NOCS to recover both absolute and relative poses.
	Flose: Generative 6D Pose Estimation via Conditional Flow Matching Amir Hamza, Davide Boscaini, Weihang Li, Benjamin Busam, Fabio Poiesi arXiv, 2026 🏆 Rank 1st on BOP Leaderboard - Model-based 6D Localization of Seen Objects arXiv / Project Page / Code Flose formulates model-based 6D pose estimation as conditional flow matching, combining geometry and appearance features with RANSAC-based registration for robust seen-object localization.
	HouseCat-TRICKY: HouseCat6D Object Pose Estimation Challenge with Specular and Transparent Objects Weihang Li et al. International Conference on Computer Vision (ICCVW), 2025 Paper / Challenge Page / Code The HouseCat-TRICKY benchmark and evaluation focus on challenging transparent and reflective object categories.
	Texture2LoD3: Enabling LoD3 Building Reconstruction With Panoramic Images Wenzhao Tang, Weihang Li, Xiucheng Liang, Olaf Wysocki, Filip Biljecki, Christoph Holst, Boris Jutzi Computer Vision and Pattern Recognition Conference Workshop on Urban Scene Modeling (CVPRW), 2025 arXiv / Project Page / Code Texture2LoD3 proposes leveraging ubiquitous street-level images and low-level building models for accurate ortho-texturing (left): Enabling accurate semantic segmentation (center) and facade-rich LoD3 reconstruction (right).
	GCE-Pose: Global Context Enhancement for Category-level Object Pose Estimation Weihang Li, Hongli Xu, Junwen Huang, HyunJun Jung, Peter KT Yu, Nassir Navab, Benjamin Busam Computer Vision and Pattern Recognition Conference (CVPR), 2025 arXiv / Project Page A semantic shape reconstruction module that recovers complete object geometry from partial observations with a global context-enhanced feature fusion mechanism that leverages category-level semantic and shape priors for robust pose prediction
	DynSUP: Dynamic Gaussian Splatting from An Unposed Image Pair Weihang Li, Weirong Chen, Shenhan Qian, Benjamin Busam, Daniel Cremers, Haoang Li IEEE Transactions on Image Processing (TIP), 2026 arXiv / Project Page / Code A novel method to achieve Gaussian splatting from an un-posed image pair in dynamic environments.
	SCRREAM: SCan, Register, REnder And Map: A Framework for Annotating Accurate and Dense 3D Indoor Scenes with a Benchmark HyunJun Jung, Weihang Li, Shun-Cheng Wu, William Bittner, Nikolas Brasch, Jifei Song, Eduardo Pérez-Pellitero, Zhensong Zhang, Arthur Moreau, Nassir Navab, Benjamin Busam In Proceedings of the Neural Information Processing Systems (NeurIPS), 2024 arXiv / Project Page / Code A framework to annotate accurate and dense 3d indoor scenes with a benchmark on novel view synthesis and SLAM
	Knowledge-based Programming by Demonstration using semantic action models for industrial assembly Junsheng Ding, Haifan, Zhang, Weihang Li, Liangwei Zhou, Alexander Perzylo International Conference on Intelligent Robots and Systems (IROS), 2024 Paper / Project Page / Code / Video A knowledge-based Programming by Demonstration (kb-PbD) paradigm to facilitate robot programming in small and medium-sized enterprises (SMEs).

Awards

[06-2024] Our team received an Honorable Mention Award in the S23DR Challenge at CVPR 2024.
[10-2025] Mentored student Haoliang to win 1st place in the WLCOP Challenge at ICCV 2025.

Teaching

Teaching Assistant

Last updated: April 2026