Concepedia

Publication | Closed Access

Safe Reinforcement Learning for Autonomous Vehicle Using Monte Carlo Tree Search

92

Citations

28

References

2021

Year

Abstract

Reinforcement learning has gradually demonstrated its decision-making ability in autonomous driving. Reinforcement learning is learning how to map states to actions by interacting with environment so as to maximize the long-term reward. Within limited interactions, the learner will get a suitable driving policy according to the designed reward function. However there will be a lot of unsafe behaviors during training in traditional reinforcement learning. This paper proposes a RL-based method combined with RL agent and Monte Carlo tree search algorithm to reduce unsafe behaviors. The proposed safe reinforcement learning framework mainly consists of two modules: risk state estimation module and safe policy search module. Once the future state will be risky calculated by the risk state estimation module using current state information and the action outputted by the RL agent, the MCTS based safe policy search module will activate to guarantee a safer exploration by adding an additional reward for risk actions. We test the approach in several random overtake scenarios, resulting in faster convergence and safer behaviors compared to traditional reinforcement learning.

References

YearCitations

2015

28.8K

2016

15.5K

2000

8.7K

2000

5.3K

2000

4.4K

2016

3.1K

2012

2.9K

2008

1.5K

2007

1.2K

2015

1.2K

Page 1