Publication | Closed Access
GDR-Net: Geometry-Guided Direct Regression Network for Monocular 6D Object Pose Estimation
405
Citations
43
References
2021
Year
Unknown Venue
EngineeringMachine LearningHuman Pose Estimation3D Pose Estimation3D Computer VisionImage AnalysisPattern RecognitionRobot LearningComputational GeometrySingle Rgb ImageGeometric ModelingMachine VisionDeep LearningPose Estimation3D Object RecognitionComputer Vision3D VisionNatural SciencesObject Pose EstimationMulti-view GeometryMonocular 6DScene Modeling
6D pose estimation from a single RGB image is a fundamental task in computer vision. The current top-performing deep learning-based methods rely on an indirect strategy, i.e., first establishing 2D-3D correspondences between the coordinates in the image plane and object coordinate system, and then applying a variant of the PnP/RANSAC algorithm. However, this two-stage pipeline is not end-to-end trainable, thus is hard to be employed for many tasks requiring differentiable poses. On the other hand, methods based on direct regression are currently inferior to geometry-based methods. In this work, we perform an in-depth investigation on both direct and indirect methods, and propose a simple yet effective Geometry-guided Direct Regression Network (GDR-Net) to learn the 6D pose in an end-to-end manner from dense correspondence-based intermediate geometric representations. Extensive experiments show that our approach remarkably outperforms state-of-the-art methods on LM, LM-O and YCB-V datasets. Code is available at https://git.io/GDR-Net.
| Year | Citations | |
|---|---|---|
Page 1
Page 1