Publication | Closed Access
Multi-Agent Reinforcement Learning-Based Decision Making for Twin-Vehicles Cooperative Driving in Stochastic Dynamic Highway Environments
22
Citations
25
References
2023
Year
Artificial IntelligenceMulti-agent Reinforcement LearningEngineeringAgent Decision-makingEducationAutonomous SystemsIntelligent SystemsReinforcement Learning (Educational Psychology)Multi-agent LearningIntelligent Traffic ManagementIntelligent Autonomous SystemsReinforcement Learning (Computer Engineering)Systems EngineeringTransportation EngineeringMulti-agent PlanningComputer ScienceAutonomous DrivingTwin-vehicles Cooperative DrivingDeep Reinforcement LearningGeneralization AdaptationRoad Traffic Control
In the past decade, reinforcement learning (RL) has achieved encouraging results in autonomous driving, especially in well-structured and regulated highway environments. However, few researches pay attention to RL-based multiple-vehicles cooperative driving, which is much more challenging because of dynamic real-time interactions and transient scenarios. This article proposes a Multi-Agent Reinforcement Learning (MARL) based twin-vehicles cooperative driving decision making method which achieves the generalization adaptation of the RL method in highly dynamic highway environments and enhances the flexibility and effectiveness of collaborative decision making system. The proposed fair cooperative MARL method pays equal attention to the individual intelligence and the cooperative performance, and employs a stable estimation method to reduce the propagation of overestimated joint <inline-formula xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink"><tex-math notation="LaTeX">$Q$</tex-math></inline-formula> -values between agents. Thus, the twin-vehicles system strikes a balance between maintaining formation and free overtaking in dynamic highway environments, to intelligently adapt to different scenarios, such as heavy traffic, loose traffic, even some emergency. Targeted experiments show that our method has strong cooperative performance, also further increases the possibility of creating a harmonious driving environment.
| Year | Citations | |
|---|---|---|
Page 1
Page 1