Publication | Closed Access
Real-Time Multi-Modal Active Vision for Object Detection on UAVs Equipped With Limited Field of View LiDAR and Camera
32
Citations
25
References
2023
Year
EngineeringField RoboticsActive VisionPoint Cloud ProcessingPoint CloudImage AnalysisUnmanned SystemUav PlatformSystems EngineeringRobot LearningVision SensorView LidarMachine VisionTime-of-flight CameraAutomatic Target RecognitionObject DetectionLimited FieldComputer ScienceComputer Vision3D VisionAerospace EngineeringUnmanned Aerial Systems
This letter aims to solve the challenging problems in multi-modal active vision for object detection on unmanned aerial vehicles (UAVs) with a monocular camera and a limited Field of View (FoV) LiDAR. The point cloud acquired from the low-cost LiDAR is firstly converted into a 3-channel tensor via motion compensation, accumulation, projection, and up-sampling processes. The generated 3-channel point cloud tensor and RGB image are fused into a 6-channel tensor using an early fusion strategy for object detection based on a Gaussian YOLO network structure. To solve the low computational resource problem and improve the real-time performance, the velocity information of the UAV is further fused with the detection results based on an extended Kalman Filter (EKF). A perception-aware model predictive control (MPC) is designed to achieve active vision on our UAV. According to our performance evaluation, our pre-processing step improves other literature methods running time by a factor of 10 while maintaining acceptable detection performance. Furthermore, our fusion architecture reaches 94.6 mAP on the test set, outperforming the individual sensor networks by roughly 5%. We also described an implementation of the overall algorithm on a UAV platform and validated it in real-world experiments.
| Year | Citations | |
|---|---|---|
Page 1
Page 1