Lane Change Decision-making through Deep Reinforcement Learning with Rule-based Constraints

Reinforcement learning (RL) is an attractive way to implement high-level decision-making policies for autonomous driving, but learning directly from a real vehicle or a high-fidelity simulator is variously infeasible. We therefore consider the problem of transfer reinforcement learning and study how a policy learned in a simple environment using WiseMove can be transferred to our high-fidelity simulator, W ise M ove . WiseMove is a framework to study safety and other aspects of RL for autonomous driving. W ise M ove accurately reproduces the dynamics and software stack of our real vehicle. We find that the accurately modelled perception errors in W ise M ove contribute the most to the transfer problem. These errors, when even naively modelled in WiseMove , provide an RL policy that performs better in W ise M ove than a hand-crafted rule-based policy. Applying domain randomization to the environment in WiseMove yields an even better policy. The final RL policy reduces the failures due to perception errors from 10% to 2.75%. We also observe that the RL policy has significantly less reliance on velocity compared to the rule-based policy, having learned that its measurement is unreliable.

Download Full-text

Discretionary Lane Change Decision Making using Reinforcement Learning with Model-Based Exploration

2019 18th IEEE International Conference On Machine Learning And Applications (ICMLA) ◽

10.1109/icmla.2019.00147 ◽

2019 ◽

Cited By ~ 2

Author(s):

Songan Zhang ◽

Huei Peng ◽

Subramanya Nageshrao ◽

Eric Tseng

Keyword(s):

Decision Making ◽

Reinforcement Learning ◽

Lane Change ◽

Model Based

Download Full-text

Automated Lane Change Decision Making using Deep Reinforcement Learning in Dynamic and Uncertain Highway Environment

2019 IEEE Intelligent Transportation Systems Conference (ITSC) ◽

10.1109/itsc.2019.8917192 ◽

2019 ◽

Cited By ~ 2

Author(s):

Ali Alizadeh ◽

Majid Moghadam ◽

Yunus Bicer ◽

Nazim Kemal Ure ◽

Ugur Yavas ◽

...

Keyword(s):

Decision Making ◽

Reinforcement Learning ◽

Lane Change

Download Full-text

Actor–critic-based decision-making method for the artificial intelligence commander in tactical wargames

The Journal of Defense Modeling and Simulation Applications Methodology Technology ◽

10.1177/1548512920954542 ◽

2020 ◽

pp. 154851292095454

Author(s):

Junfeng Zhang ◽

Qing Xue

Keyword(s):

Neural Network ◽

Artificial Intelligence ◽

Decision Making ◽

Reinforcement Learning ◽

Convolutional Neural Network ◽

Difficult Problem ◽

Learning Method ◽

Rule Based ◽

Autonomous Decision ◽

Decision Making Problem

In a tactical wargame, the decisions of the artificial intelligence (AI) commander are critical to the final combat result. Due to the existence of fog-of-war, AI commanders are faced with unknown and invisible information on the battlefield and lack of understanding of the situation, and it is difficult to make appropriate tactical strategies. The traditional knowledge rule-based decision-making method lacks flexibility and autonomy. How to make flexible and autonomous decision-making when facing complex battlefield situations is a difficult problem. This paper aims to solve the decision-making problem of the AI commander by using the deep reinforcement learning (DRL) method. We develop a tactical wargame as the research environment, which contains built-in script AI and supports the machine–machine combat mode. On this basis, an end-to-end actor–critic framework for commander decision making based on the convolutional neural network is designed to represent the battlefield situation and the reinforcement learning method is used to try different tactical strategies. Finally, we carry out a combat experiment between a DRL-based agent and a rule-based agent in a jungle terrain scenario. The result shows that the AI commander who adopts the actor–critic method successfully learns how to get a higher score in the tactical wargame, and the DRL-based agent has a higher winning ratio than the rule-based agent.

Download Full-text

Reinforcement Learning with Data Augmentation for Lane Change Decision-Making

Journal of Institute of Control Robotics and Systems ◽

10.5302/j.icros.2021.21.0064 ◽

2021 ◽

Vol 27 (8) ◽

pp. 572-577

Author(s):

Min-Seong Kim ◽

Gyuho Eoh ◽

Tae-Hyoung Park

Keyword(s):

Decision Making ◽

Reinforcement Learning ◽

Data Augmentation ◽

Lane Change

Download Full-text

Risk Perception Oriented Autonomous Ship Navigation in AIS Environment

Volume 1: Offshore Technology ◽

10.1115/omae2020-18003 ◽

2020 ◽

Author(s):

Ruolan Zhang ◽

Masao Furusho

Keyword(s):

Decision Making ◽

Reinforcement Learning ◽

Risk Perception ◽

Adaptive Systems ◽

Learning Algorithm ◽

Automatic Identification ◽

Identification System ◽

Rule Based ◽

Ship Navigation ◽

Frame Motion

Abstract Due to the quality and error of the data itself, historical automatic identification system (AIS) data was insufficient used to predict navigation risk at sea, but it adequately used to train decision-making neural networks. This paper presents a real AIS ship navigation environment with a rule-based and a neural-based decision processes with frame motion and training the decision network using a deep reinforcement learning algorithm. Rule-based decision-making has several applications in the field of adaptive systems, expert systems, and decision support systems, it also including general ship navigation which regulated by the convention on the international regulations for preventing collisions at sea (COLREGs). However, if someone intend to achieve full unmanned ship navigation without any remote control at the open sea, a rule-based decision-making system cannot be implemented alone. With the growing amount of data, complex sea environment, different collision scenarios, the agent-based decision has become an important role in transportation. For ships, combined rule-based and neural-based decision-making is the only option. It has become progressively challenging to satisfy autonomous decision-making development requirements. This study uses deep reinforcement learning to evaluate the performance of decision-making efficiency under different AIS data input shapes. The results show that the decision neural network trained with AIS data has good robustness and a high ability to achieve collision avoidance. Furthermore, using the same methodology, include instructive guidance for processing radar, camera, ENC, etc., respond to different risk perception tasks in different scenarios. It has important implications for fully unmanned navigation.

Download Full-text