Publication | Open Access
DyFusion: Cross-Attention 3D Object Detection with Dynamic Fusion
25
Citations
37
References
2024
Year
EngineeringMachine LearningInsufficient Data AugmentationPoint Cloud ProcessingPoint Cloud3D Computer VisionImage AnalysisData SciencePattern RecognitionRobot LearningSensor FusionMachine VisionObject DetectionFusion AlgorithmsComputer ScienceAutonomous DrivingMedical Image ComputingDeep Learning3D Object RecognitionComputer Vision3D VisionDynamic Fusion
In the realm of autonomous driving, LiDAR and camera sensors play an indispensable role, furnishing pivotal observational data for the critical task of precise 3D object detection. Existing fusion algorithms effectively utilize the complementary data from both sensors. However, these methods typically concatenate the raw point cloud data and pixel-level image features, unfortunately, a process that introduces errors and results in the loss of critical information embedded in each modality. To mitigate the problem of lost feature information, this paper proposes a Cross-Attention Dynamic Fusion (CADF) strategy that dynamically fuses the two heterogeneous data sources. In addition, we acknowledge the issue of insufficient data augmentation for these two diverse modalities. To combat this, we propose a Synchronous Data Augmentation (SDA) strategy designed to enhance training efficiency. We have tested our method using the KITTI and nuScenes datasets, and the results have been promising. Remarkably, our top-performing model attained an 82.52% mAP on the KITTI test benchmark, outperforming other state-of-the-art methods.
| Year | Citations | |
|---|---|---|
Page 1
Page 1