Concepedia

Publication | Open Access

Modeling Car-Following Behaviors and Driving Styles with Generative Adversarial Imitation Learning

44

Citations

42

References

2020

Year

TLDR

Accurately simulating drivers’ car‑following behavior is essential for driving assistance and autonomous systems, and recent reinforcement learning approaches have shown promise, but defining reward functions manually remains challenging. This paper proposes a novel car‑following model based on generative adversarial imitation learning. The model learns from driver demonstrations using a generative adversarial imitation learning framework with gated recurrent units in an actor‑critic network, trained on millimeter‑wave radar and CAN data and classifying drivers into two styles via K‑means clustering on time‑headway features. Five‑fold cross‑validation shows the model reproduces trajectories and styles more accurately than IDM and RNN, achieving the lowest average spacing error (19.40 %) and speed validation error (5.57 %) and the lowest KL divergences for style clustering.

Abstract

Building a human-like car-following model that can accurately simulate drivers’ car-following behaviors is helpful to the development of driving assistance systems and autonomous driving. Recent studies have shown the advantages of applying reinforcement learning methods in car-following modeling. However, a problem has remained where it is difficult to manually determine the reward function. This paper proposes a novel car-following model based on generative adversarial imitation learning. The proposed model can learn the strategy from drivers’ demonstrations without specifying the reward. Gated recurrent units was incorporated in the actor-critic network to enable the model to use historical information. Drivers’ car-following data collected by a test vehicle equipped with a millimeter-wave radar and controller area network acquisition card was used. The participants were divided into two driving styles by K-means with time-headway and time-headway when braking used as input features. Adopting five-fold cross-validation for model evaluation, the results show that the proposed model can reproduce drivers’ car-following trajectories and driving styles more accurately than the intelligent driver model and the recurrent neural network-based model, with the lowest average spacing error (19.40%) and speed validation error (5.57%), as well as the lowest Kullback-Leibler divergences of the two indicators used for driving style clustering.

References

YearCitations

Page 1