Publication | Closed Access
Fully Sparse Fusion for 3D Object Detection
36
Citations
52
References
2024
Year
EngineeringMachine LearningInstance Segmentation PartPoint Cloud Processing3D Computer VisionImage AnalysisData SciencePattern RecognitionDense DetectorsComputational ImagingSparse FusionMachine VisionObject DetectionComputer ScienceDeep Learning3D Object RecognitionComputer Vision3D VisionInstance Segmentation
Currently prevalent multi-modal 3D detection methods rely on dense detectors that usually use dense Bird's-Eye-View (BEV) feature maps. However, the cost of such BEV feature maps is quadratic to the detection range, making it not scalable for long-range detection. Recently, LiDAR-only fully sparse architecture has been gaining attention for its high efficiency in long-range perception. In this paper, we study how to develop a multi-modal fully sparse detector. Specifically, our proposed detector integrates the well-studied 2D instance segmentation into the LiDAR side, which is parallel to the 3D instance segmentation part in the LiDAR-only baseline. The proposed instance-based fusion framework maintains full sparsity while overcoming the constraints associated with the LiDAR-only fully sparse detector. Our framework showcases state-of-the-art performance on the widely used nuScenes dataset, Waymo Open Dataset, and the long-range Argoverse 2 dataset. Notably, the inference speed of our proposed method under the long-range perception setting is 2.7× faster than that of other state-of-the-art multimodal 3D detection methods.
| Year | Citations | |
|---|---|---|
Page 1
Page 1