Publication | Closed Access
Research on autonomous formation of Multi-UAV based on MADDPG algorithm
16
Citations
8
References
2022
Year
Multi-UAV autonomous formation task is one of the important research hotspots in UAV application. A typical mission scenario is constructed for the autonomous formation, maintenance and obstacle avoidance tasks of multi-UAV. Based on Multiple-Agents Deep Deterministic Policy Gradient (MADDPG) algorithm, we designed a hybrid reward distribution mechanism for autonomous formation task, which effectively solved the problem of uneven global reward distribution and individual "selfish strategy", and designed a dynamic communication strategy inside the UAV formation to reduce the computational complexity. It can effectively improve the communication efficiency between UAVs. After training, the UAV can effectively avoid the no-fly zone and efficiently perform autonomous formation flight task. The introduction of the hybrid reward mechanism improves the stability of the UAV autonomous formation task and has certain application prospects.
| Year | Citations | |
|---|---|---|
Page 1
Page 1