Proposal for Selecting a Cooperation Partner in Distributed Control of Traffic Signals using Deep Reinforcement Learning

Designing efficient transportation systems is crucial to save time and money for drivers and for the economy as whole. One of the most important components of traffic systems are traffic signals. Currently, most traffic signal systems are configured using fixed timing plans, which are based on limited vehicle count data. Past research has introduced and designed intelligent traffic signals; however, machine learning and deep learning have only recently been used in systems that aim to optimize the timing of traffic signals in order to reduce travel time. A very promising field in Artificial Intelligence is Reinforcement Learning. Reinforcement learning (RL) is a data driven method that has shown promising results in optimizing traffic signal timing plans to reduce traffic congestion. However, model-based and centralized methods are impractical here due to the high dimensional state-action space in complex urban traffic network. In this paper, a model-free approach is used to optimize signal timing for complicated multiple four-phase signalized intersections. We propose a multi-agent deep reinforcement learning framework that aims to optimize traffic flow using data within traffic signal intersections and data coming from other intersections in a Multi-Agent Environment in what is called Multi-Agent Reinforcement Learning (MARL). The proposed model consists of state-of-art techniques such as Double Deep Q-Network and Hindsight Experience Replay (HER). This research uses HER to allow our framework to quickly learn on sparse reward settings. We tested and evaluated our proposed model via a Simulation of Urban MObility simulation (SUMO). Our results show that the proposed method is effective in reducing congestion in both peak and off-peak times.

Download Full-text

Distributed control of leader-follower systems under adversarial inputs using reinforcement learning

2017 IEEE Symposium Series on Computational Intelligence (SSCI) ◽

10.1109/ssci.2017.8280840 ◽

2017 ◽

Cited By ~ 1

Author(s):

Rohollah Moghadam ◽

Qinglai Wei ◽

Hamidreza Modares

Keyword(s):

Reinforcement Learning ◽

Distributed Control

Download Full-text

Reinforcement Learning for Joint Control of Traffic Signals in a Transportation Network

IEEE Transactions on Vehicular Technology ◽

10.1109/tvt.2019.2962514 ◽

2020 ◽

Vol 69 (2) ◽

pp. 1375-1387 ◽

Cited By ~ 1

Author(s):

Jincheol Lee ◽

Jiyong Chung ◽

Keemin Sohn

Keyword(s):

Reinforcement Learning ◽

Transportation Network ◽

Traffic Signals ◽

Joint Control

Download Full-text

A Distributed Control Method for Urban Networks Using Multi-Agent Reinforcement Learning Based on Regional Mixed Strategy Nash-Equilibrium

IEEE Access ◽

10.1109/access.2020.2968937 ◽

2020 ◽

Vol 8 ◽

pp. 19750-19766 ◽

Cited By ~ 1

Author(s):

Zhaowei Qu ◽

Zhaotian Pan ◽

Yongheng Chen ◽

Xin Wang ◽

Haitao Li

Keyword(s):

Reinforcement Learning ◽

Nash Equilibrium ◽

Distributed Control ◽

Mixed Strategy ◽

Control Method ◽

Urban Networks ◽

Mixed Strategy Nash Equilibrium ◽

Multi Agent ◽

Strategy Nash Equilibrium

Download Full-text

Adaptive Traffic Signal Control Model on Intersections Based on Deep Reinforcement Learning

Journal of Advanced Transportation ◽

10.1155/2020/6505893 ◽

2020 ◽

Vol 2020 ◽

pp. 1-14

Author(s):

Duowei Li ◽

Jianping Wu ◽

Ming Xu ◽

Ziheng Wang ◽

Kezhen Hu

Keyword(s):

Reinforcement Learning ◽

Waiting Time ◽

Traffic Signals ◽

Control Model ◽

Traffic Signal ◽

Signal Control ◽

Traffic Signal Control ◽

Average Waiting Time ◽

Adaptive Traffic Signal Control ◽

Proposed Model

Controlling traffic signals to alleviate increasing traffic pressure is a concept that has received public attention for a long time. However, existing systems and methodologies for controlling traffic signals are insufficient for addressing the problem. To this end, we build a truly adaptive traffic signal control model in a traffic microsimulator, i.e., “Simulation of Urban Mobility” (SUMO), using the technology of modern deep reinforcement learning. The model is proposed based on a deep Q-network algorithm that precisely represents the elements associated with the problem: agents, environments, and actions. The real-time state of traffic, including the number of vehicles and the average speed, at one or more intersections is used as an input to the model. To reduce the average waiting time, the agents provide an optimal traffic signal phase and duration that should be implemented in both single-intersection cases and multi-intersection cases. The co-operation between agents enables the model to achieve an improvement in overall performance in a large road network. By testing with data sets pertaining to three different traffic conditions, we prove that the proposed model is better than other methods (e.g., Q-learning method, longest queue first method, and Webster fixed timing control method) for all cases. The proposed model reduces both the average waiting time and travel time, and it becomes more advantageous as the traffic environment becomes more complex.

Download Full-text

Event-Triggered Distributed Control of Nonlinear Interconnected Systems Using Online Reinforcement Learning With Exploration

IEEE Transactions on Cybernetics ◽

10.1109/tcyb.2017.2741342 ◽

2018 ◽

Vol 48 (9) ◽

pp. 2510-2519 ◽

Cited By ~ 23

Author(s):

Vignesh Narayanan ◽

Sarangapani Jagannathan

Keyword(s):

Reinforcement Learning ◽

Distributed Control ◽

Interconnected Systems ◽

Event Triggered

Download Full-text

Reinforcement learning vs. rule-based adaptive traffic signal control: A Fourier basis linear function approximation for traffic signal control

AI Communications ◽

10.3233/aic-201580 ◽

2021 ◽

pp. 1-15

Author(s):

Theresa Ziemke ◽

Lucas N. Alegre ◽

Ana L.C. Bazzan

Keyword(s):

Reinforcement Learning ◽

State Space ◽

Function Approximation ◽

Traffic Signals ◽

The State ◽

Signal Control ◽

Traffic Signal Control ◽

Rule Based ◽

Fourier Basis ◽

Linear Function Approximation

Reinforcement learning is an efficient, widely used machine learning technique that performs well when the state and action spaces have a reasonable size. This is rarely the case regarding control-related problems, as for instance controlling traffic signals. Here, the state space can be very large. In order to deal with the curse of dimensionality, a rough discretization of such space can be employed. However, this is effective just up to a certain point. A way to mitigate this is to use techniques that generalize the state space such as function approximation. In this paper, a linear function approximation is used. Specifically, SARSA ( λ ) with Fourier basis features is implemented to control traffic signals in the agent-based transport simulation MATSim. The results are compared not only to trivial controllers such as fixed-time, but also to state-of-the-art rule-based adaptive methods. It is concluded that SARSA ( λ ) with Fourier basis features is able to outperform such methods, especially in scenarios with varying traffic demands or unexpected events.

Download Full-text