scholarly journals A Day-to-Day Route Choice Model Based on Reinforcement Learning

2014 ◽  
Vol 2014 ◽  
pp. 1-19 ◽  
Author(s):  
Fangfang Wei ◽  
Shoufeng Ma ◽  
Ning Jia

Day-to-day traffic dynamics are generated by individual traveler’s route choice and route adjustment behaviors, which are appropriate to be researched by using agent-based model and learning theory. In this paper, we propose a day-to-day route choice model based on reinforcement learning and multiagent simulation. Travelers’ memory, learning rate, and experience cognition are taken into account. Then the model is verified and analyzed. Results show that the network flow can converge to user equilibrium (UE) if travelers can remember all the travel time they have experienced, but which is not necessarily the case under limited memory; learning rate can strengthen the flow fluctuation, but memory leads to the contrary side; moreover, high learning rate results in the cyclical oscillation during the process of flow evolution. Finally, both the scenarios of link capacity degradation and random link capacity are used to illustrate the model’s applications. Analyses and applications of our model demonstrate the model is reasonable and useful for studying the day-to-day traffic dynamics.

Author(s):  
Anthony Chen ◽  
Panatda Kasikitwiwat ◽  
Zhaowang Ji

Recently, there has been renewed interest in improving the logit-based route choice model because of the importance of the route choice model in intelligent transportation systems applications, particularly the applications of advanced traveler information systems. The paired combinatorial logit (PCL) model and its equivalent mathematical programming formulation for the route choice problem have been studied. An algorithm based on the partial linearization method is presented for solving the PCL stochastic user equilibrium problem. Detailed examples are provided to explain how this hierarchical logit model resolves the overlapping problem through the similarity index while still accounting for both congestion and stochastic effects in the mathematical programming formulation.


2011 ◽  
Vol 130-134 ◽  
pp. 3716-3720
Author(s):  
Yi Ran Cheng ◽  
Yin Han ◽  
Xin Kai Jiang ◽  
Jia Lei Gu

Considering the un-deterministic transportation networks, the paper proposes the change of the route choice decisions under the stochastic transportation networks. The route choice behavior is described as a choice for a time shortest route which is subject to a time-reliability level. The paper also considered this new route choice behavior in the stochastic user equilibrium model, and proposed stochastic user equilibrium model based on the optimized reliability travel time route choice behavior in the stochastic networks. The equivalence and uniqueness of the solution of the model are demonstrated. Numerical results of a small network show that the proposed model can reflect the real traveler’s route choice behavior in stochastic transportation networks.


2021 ◽  
Vol 2021 ◽  
pp. 1-15
Author(s):  
Lingjuan Chen ◽  
Yu Wang ◽  
Dongfang Ma

Accurate prediction of travellers’ day-to-day departure time and route choice is critical in advanced traffic management systems. There have been several related works about route choice with the assumption that the departure time for individual travellers is known beforehand. With real-time traffic state information provided by navigation systems and previous historical experience, travellers will dynamically update their departure time, which is neglected in existing works. In this study, we aim to describe travellers’ spatial-temporary choice behaviour taking navigation information into account and propose a bounded-rational day-to-day dynamic learning and adjustment model. The new model contains three steps. First, the real-time navigation guidance on each discrete day is obtained, and the self-learned experience of travellers’ choices with navigation information is presented; then, the day-to-day revision process of the choices is derived to maximize departure and route choice prospect; next, by aggregating each individual’s behaviour and calculating route choice probability, a bounded-rational continuous day-to-day dynamic model is provided. Numerical experiments suggest that the proposed model converges to a spatial-temporal oscillating equilibrium not a fixed-point stable status, and the final equilibrium trend is different from classical user equilibrium. The findings of the study are helpful to improve the prediction accuracy of traffic state in urban street networks.


Sign in / Sign up

Export Citation Format

Share Document