Spatial-Temporal Traffic Flow Control on Motorways Using Distributed Multi-Agent Reinforcement Learning

The prevailing variable speed limit (VSL) systems as an effective strategy for traffic control on motorways have the disadvantage that they only work with static VSL zones. Under changing traffic conditions, VSL systems with static VSL zones may perform suboptimally. Therefore, the adaptive design of VSL zones is required in traffic scenarios where congestion characteristics vary widely over space and time. To address this problem, we propose a novel distributed spatial-temporal multi-agent VSL (DWL-ST-VSL) approach capable of dynamically adjusting the length and position of VSL zones to complement the adjustment of speed limits in current VSL control systems. To model DWL-ST-VSL, distributed W-learning (DWL), a reinforcement learning (RL)-based algorithm for collaborative agent-based self-optimization toward multiple policies, is used. Each agent uses RL to learn local policies, thereby maximizing travel speed and eliminating congestion. In addition to local policies, through the concept of remote policies, agents learn how their actions affect their immediate neighbours and which policy or action is preferred in a given situation. To assess the impact of deploying additional agents in the control loop and the different cooperation levels on the control process, DWL-ST-VSL is evaluated in a four-agent configuration (DWL4-ST-VSL). This evaluation is done via SUMO microscopic simulations using collaborative agents controlling four segments upstream of the congestion in traffic scenarios with medium and high traffic loads. DWL also allows for heterogeneity in agents’ policies; cooperating agents in DWL4-ST-VSL implement two speed limit sets with different granularity. DWL4-ST-VSL outperforms all baselines (W-learning-based VSL and simple proportional speed control), which use static VSL zones. Finally, our experiments yield insights into the new concept of VSL control. This may trigger further research on using advanced learning-based technology to design a new generation of adaptive traffic control systems to meet the requirements of operating in a nonstationary environment and at the leading edge of emerging connected and autonomous vehicles in general.

Download Full-text

Assessing the mobility benefits of proactive optimal variable speed limit control during recurrent and non-recurrent congestion

Canadian Journal of Civil Engineering ◽

10.1139/cjce-2013-0427 ◽

2015 ◽

Vol 42 (7) ◽

pp. 477-489 ◽

Cited By ~ 4

Author(s):

Ying Luo ◽

M. Hadiuzzaman ◽

Jie Fang ◽

Tony Z. Qiu

Keyword(s):

Traffic Control ◽

Propagation Speed ◽

Speed Limit ◽

Variable Speed ◽

Variable Speed Limit ◽

Detection Delay ◽

Traffic Performance ◽

Total Travel Time ◽

Variable Speed Limit Control ◽

The Impact

Over the past few decades, several active traffic control methods have been proposed to improve freeway efficiency at bottleneck locations. Variable speed limit (VSL) is one of these effective controls. Previous studies have evaluated VSL control, but primarily during recurrent congestion only. This study focuses on evaluating the performance of VSL control for both recurrent and non-recurrent congestion. To assess the effectiveness of a previously proposed VSL control in a real-world situation, this study has three evaluation objectives: (1) examine the control performance when recurrent and (or) non-recurrent congestion occurs; (2) assess the effectiveness of the control when a queue encounters the VSL sign; and (3) consider the impact of system detection delay in VSL control. Comparative experiments for Whitemud Drive in Edmonton, Alberta, Canada, are simulated in the VISSIM platform, and traffic performance is compared among scenarios with and without control. The simulation results show that VSL improves mobility for both recurrent and non-recurrent congestion. The VSL control reduces total travel time, and improves total travel distance and total flow. Furthermore, it slows down the shockwave propagation speed, improves the average speed on most of the freeway segments, and reduces the duration of traffic recovery.

Download Full-text

Urban Traffic Control in Software Defined Internet of Things via a Multi-Agent Deep Reinforcement Learning Approach

IEEE Transactions on Intelligent Transportation Systems ◽

10.1109/tits.2020.3023788 ◽

2020 ◽

pp. 1-13

Author(s):

Jiachen Yang ◽

Jipeng Zhang ◽

Huihui Wang

Keyword(s):

Reinforcement Learning ◽

Internet Of Things ◽

Traffic Control ◽

Urban Traffic ◽

Learning Approach ◽

Multi Agent ◽

Urban Traffic Control

Download Full-text

Multi-agent Reinforcement Learning for Control Systems: Challenges and Proposals

Intelligent Data Engineering and Automated Learning – IDEAL 2015 - Lecture Notes in Computer Science ◽

10.1007/978-3-319-24834-9_3 ◽

2015 ◽

pp. 18-25

Author(s):

Manuel Graña ◽

Borja Fernandez-Gauna

Keyword(s):

Reinforcement Learning ◽

Control Systems ◽

Multi Agent

Download Full-text

Reducing Bus Bunching with Asynchronous Multi-Agent Reinforcement Learning

Proceedings of the Thirtieth International Joint Conference on Artificial Intelligence ◽

10.24963/ijcai.2021/60 ◽

2021 ◽

Author(s):

Jiawei Wang ◽

Lijun Sun

Keyword(s):

Reinforcement Learning ◽

Traffic Control ◽

Inductive Learning ◽

Traffic Conditions ◽

Passenger Demand ◽

Smart Card Data ◽

Proposed Model ◽

Bus Bunching ◽

Multi Agent ◽

Bus Services

The bus system is a critical component of sustainable urban transportation. However, due to the significant uncertainties in passenger demand and traffic conditions, bus operation is unstable in nature and bus bunching has become a common phenomenon that undermines the reliability and efficiency of bus services. Despite recent advances in multi-agent reinforcement learning (MARL) on traffic control, little research has focused on bus fleet control due to the tricky asynchronous characteristic---control actions only happen when a bus arrives at a bus stop and thus agents do not act simultaneously. In this study, we formulate route-level bus fleet control as an asynchronous multi-agent reinforcement learning (ASMR) problem and extend the classical actor-critic architecture to handle the asynchronous issue. Specifically, we design a novel critic network to effectively approximate the marginal contribution for other agents, in which graph attention neural network is used to conduct inductive learning for policy evaluation. The critic structure also helps the ego agent optimize its policy more efficiently. We evaluate the proposed framework on real-world bus services and actual passenger demand derived from smart card data. Our results show that the proposed model outperforms both traditional headway-based control methods and existing MARL methods.

Download Full-text

The Impact of Automation on Air Traffic Control Systems

Automation and Systems Issues in Air Traffic Control ◽

10.1007/978-3-642-76556-8_1 ◽

1991 ◽

pp. 3-19 ◽

Cited By ~ 5

Author(s):

V. David Hopkin

Keyword(s):

Control Systems ◽

Traffic Control ◽

Air Traffic Control ◽

Air Traffic ◽

The Impact

Download Full-text

Urban Traffic Control Using Distributed Multi-agent Deep Reinforcement Learning

PRICAI 2019: Trends in Artificial Intelligence - Lecture Notes in Computer Science ◽

10.1007/978-3-030-29894-4_28 ◽

2019 ◽

pp. 337-349

Author(s):

Shunya Kitagawa ◽

Ahmed Moustafa ◽

Takayuki Ito

Keyword(s):

Reinforcement Learning ◽

Traffic Control ◽

Urban Traffic ◽

Multi Agent ◽

Urban Traffic Control

Download Full-text

Extended Variable Speed Limit control using Multi-agent Reinforcement Learning

2020 IEEE 23rd International Conference on Intelligent Transportation Systems (ITSC) ◽

10.1109/itsc45102.2020.9294639 ◽

2020 ◽

Author(s):

Kresimir Kusic ◽

Ivana Dusparic ◽

Maxime Gueriau ◽

Martin Greguric ◽

Edouard Ivanjko

Keyword(s):

Reinforcement Learning ◽

Speed Limit ◽

Variable Speed ◽

Variable Speed Limit ◽

Multi Agent ◽

Variable Speed Limit Control

Download Full-text

A Study on the Merits for Coordinated Use of Ramp Metering and Variable Speed Limit Traffic Control Systems

The Open Transportation Journal ◽

10.2174/1874447801812010230 ◽

2018 ◽

Vol 12 (1) ◽

pp. 230-245

Author(s):

Mehdi Fallah Tafti

Keyword(s):

Control Systems ◽

Traffic Control ◽

Network Performance ◽

Synergistic Effects ◽

Ramp Metering ◽

Speed Limit ◽

Variable Speed ◽

Microscopic Simulation ◽

Variable Speed Limit ◽

Simulated Network

Aim:The aim of this research was to investigate the merits for further improvements of traffic operation on freeways and expressways through coordinated use of Ramp Metering and Variable Speed limit (VSL) control systems.Methods:In this research, the widely used ALINEA Ramp Metering strategy was coordinated with a rule-based VSL strategy so that the total flow entered from the upstream freeway and entry ramp is maintained below the merge downstream capacity. The developed algorithm was then examined on a freeway network comprising two merge and one diverge sections, using VISSIM microscopic simulation model. The performance of the simulated network was examined under three scenarios namely, No-control, Ramp Metering only and Ramp Metering plus VSL controls. The network performance under each scenario was then assessed and compared using three measures of performance namely, average travel time, overall delay and freeway throughput. The ANOVA test was used to analyze and compare the impacts of specified scenarios.Results:The results indicated that the best performance is achieved under coordinated Ramp Metering plus VSL scenario as it produced a significantly better performance in comparison with the other two scenarios.Conclusion:The results can be attributed to the synergistic effects of coordinated and integrated use of these control systems on the freeway network and therefore, coordination of such systems is recommended.

Download Full-text