Category-Level 3D Correspondence in Camera Space via Morphable Object Priors
Abstract
Category-level 3D correspondence is learned from single images through a shared morphable object prior, enabling semantic 3D object understanding without explicit correspondence supervision.
Understanding 3D objects from images is fundamental to robotics and AR/VR applications. While recent work has made progress in category-level pose estimation, current representations fail to capture the fine-grained semantics needed for reasoning about object parts, functions, and interactions. In this work, we study category-level 3D correspondence in camera space -- predicting, from a single image, 3D locations that remain consistent across instances within a category -- and show that it can emerge without explicit correspondence supervision by learning a shared morphable object prior. To enable research in this direction, we introduce HouseCorr3D, the first large-scale benchmark for monocular category-level 3D correspondence with 178k images across 50 household object categories, 280 unique instances, and 3D keypoint annotations directly on CAD models. Crucially, HouseCorr3D provides amodal correspondence labels for occluded regions and explicit symmetry annotations, addressing key limitations of existing datasets. We further propose Morpheus, a method that learns morphable category-level shape priors by disentangling canonical shape, deformation, and object pose. Through this shared canonical grounding, semantically meaningful 3D correspondences in camera space emerge implicitly. These emerging 3D correspondences set a new state of the art on HouseCorr3D, demonstrating that semantic 3D object understanding can arise without direct correspondence supervision. Data and code are publicly available at https://github.com/GenIntel/HouseCorr3D.
Community
We introduce HouseCorr3D, a large-scale benchmark for category-level 3D correspondence with 178k images across 50 household object categories. The dataset uniquely includes amodal correspondence labels for occluded regions and explicit symmetry annotations. We propose Morpheus, which learns morphable shape priors to enable 3D correspondence without explicit supervision, achieving SOTA results on HouseCorr3D.
Feel free to star our repo to get notified as soon as the data and code comes out!
This is an automated message from the Librarian Bot. I found the following papers similar to this paper.
The following papers were recommended by the Semantic Scholar API
- OCH3R: Object-Centric Holistic 3D Reconstruction (2026)
- OneViewAll: Semantic Prior Guided One-View 6D Pose Estimation for Novel Objects (2026)
- Human Interaction-Aware 3D Reconstruction from a Single Image (2026)
- ComPose: A Unified Completion-Pose Framework for Robust Category-Level Object Pose Estimation (2026)
- MAPRPose: Mask-Aware Proposal and Amodal Refinement for Multi-Object 6D Pose Estimation (2026)
- Exploring 6D Object Pose Estimation with Deformation (2026)
- SGSoft: Learning Fused Semantic-Geometric Features for 3D Shape Correspondence via Template-Guided Soft Signals (2026)
Please give a thumbs up to this comment if you found it helpful!
If you want recommendations for any Paper on Hugging Face checkout this Space
You can directly ask Librarian Bot for paper recommendations by tagging it in a comment: @librarian-bot recommend
Get this paper in your agent:
hf papers read 2605.28257 Don't have the latest CLI?
curl -LsSf https://hf.co/cli/install.sh | bash Models citing this paper 0
No model linking this paper
Datasets citing this paper 0
No dataset linking this paper
Spaces citing this paper 0
No Space linking this paper
Collections including this paper 0
No Collection including this paper