Publication | Closed Access
Monocular Localization in HD Maps by Combining Semantic Segmentation and Distance Transform
32
Citations
25
References
2020
Year
Unknown Venue
EngineeringHd MapsRobust Long-term LocalizationMonocular LocalizationLocalizationDistance Transform3D Computer VisionImage AnalysisPattern RecognitionRobot LearningHd MapCartographyMachine VisionObject DetectionVehicle LocalizationComputer ScienceStructure From MotionDeep LearningVisual LocalizationComputer Vision3D VisionOdometryMulti-view GeometryScene Modeling
Easy, yet robust long-term localization is still an open topic in research. Existing approaches require either dense maps, expensive sensors, specialized map features or proprietary detectors.We propose using semantic segmentation on a monocular camera to localize directly in a HD map as used for automated driving. This combines lightweight, yet powerful HD maps with the simplicity of monocular vision and the flexibility of neural networks.The major challenges arising from this combination are data association and robustness against misdetections. Association is solved efficiently by applying distance transform on binary per-class images. This provides not only a fast lookup table for a smooth gradient as needed for pose-graph optimization, but also dynamic association by default.A sliding-window pose graph optimization combines single image detections with vehicle odometry, smoothing results and helping overcome even misclassifications in consecutive frames.Evaluation against a highly accurate 6D visual localization shows that our approach can achieve accuracy levels as required for automated driving, being one of the most lightweight and flexible methods to do so.
| Year | Citations | |
|---|---|---|
Page 1
Page 1