Discretionary Lane Change Decision Making using Reinforcement Learning with Model-Based Exploration

Reinforcement learning and decision-making (RLDM) provide a quantitative framework and computational theories with which we can disentangle psychiatric conditions into the basic dimensions of neurocognitive functioning. RLDM offer a novel approach to assessing and potentially diagnosing psychiatric patients, and there is growing enthusiasm for both RLDM and computational psychiatry among clinical researchers. Such a framework can also provide insights into the brain substrates of particular RLDM processes, as exemplified by model-based analysis of data from functional magnetic resonance imaging (fMRI) or electroencephalography (EEG). However, researchers often find the approach too technical and have difficulty adopting it for their research. Thus, a critical need remains to develop a user-friendly tool for the wide dissemination of computational psychiatric methods. We introduce an R package called hBayesDM (hierarchical Bayesian modeling of Decision-Making tasks), which offers computational modeling of an array of RLDM tasks and social exchange games. The hBayesDM package offers state-of-the-art hierarchical Bayesian modeling, in which both individual and group parameters (i.e., posterior distributions) are estimated simultaneously in a mutually constraining fashion. At the same time, the package is extremely user-friendly: users can perform computational modeling, output visualization, and Bayesian model comparisons, each with a single line of coding. Users can also extract the trial-by-trial latent variables (e.g., prediction errors) required for model-based fMRI/EEG. With the hBayesDM package, we anticipate that anyone with minimal knowledge of programming can take advantage of cutting-edge computational-modeling approaches to investigate the underlying processes of and interactions between multiple decision-making (e.g., goal-directed, habitual, and Pavlovian) systems. In this way, we expect that the hBayesDM package will contribute to the dissemination of advanced modeling approaches and enable a wide range of researchers to easily perform computational psychiatric research within different populations.

Download Full-text

Reinforcement Learning with Data Augmentation for Lane Change Decision-Making

Journal of Institute of Control Robotics and Systems ◽

10.5302/j.icros.2021.21.0064 ◽

2021 ◽

Vol 27 (8) ◽

pp. 572-577

Author(s):

Min-Seong Kim ◽

Gyuho Eoh ◽

Tae-Hyoung Park

Keyword(s):

Decision Making ◽

Reinforcement Learning ◽

Data Augmentation ◽

Lane Change

Download Full-text

Reinforcement Learning based Lane Change Decision-Making with Imaginary Sampling

2019 IEEE Symposium Series on Computational Intelligence (SSCI) ◽

10.1109/ssci44817.2019.9003029 ◽

2019 ◽

Author(s):

Dong Li ◽

Dongbin Zhao ◽

Qichao Zhang

Keyword(s):

Decision Making ◽

Reinforcement Learning ◽

Lane Change

Download Full-text

Lane Change Decision-making through Deep Reinforcement Learning with Rule-based Constraints

2019 International Joint Conference on Neural Networks (IJCNN) ◽

10.1109/ijcnn.2019.8852110 ◽

2019 ◽

Cited By ~ 5

Author(s):

Junjie Wang ◽

Qichao Zhang ◽

Dongbin Zhao ◽

Yaran Chen

Keyword(s):

Decision Making ◽

Reinforcement Learning ◽

Lane Change ◽

Rule Based

Download Full-text

Highway Lane Change Decision-Making via Attention-Based Deep Reinforcement Learning

IEEE/CAA Journal of Automatica Sinica ◽

10.1109/jas.2021.1004395 ◽

2022 ◽

Vol 9 (3) ◽

pp. 567-569

Author(s):

Junjie Wang ◽

Qichao Zhang ◽

Dongbin Zhao

Keyword(s):

Decision Making ◽

Reinforcement Learning ◽

Lane Change

Download Full-text

Model based planners reflect on their model-free propensities

PLoS Computational Biology ◽

10.1371/journal.pcbi.1008552 ◽

2021 ◽

Vol 17 (1) ◽

pp. e1008552

Author(s):

Rani Moran ◽

Mehdi Keramati ◽

Raymond J. Dolan

Keyword(s):

Decision Making ◽

Reinforcement Learning ◽

Drug Abuse ◽

Learning Theory ◽

Planning Model ◽

Present Evidence ◽

Model Based ◽

Model Free ◽

Short And Long Term

Dual-reinforcement learning theory proposes behaviour is under the tutelage of a retrospective, value-caching, model-free (MF) system and a prospective-planning, model-based (MB), system. This architecture raises a question as to the degree to which, when devising a plan, a MB controller takes account of influences from its MF counterpart. We present evidence that such a sophisticated self-reflective MB planner incorporates an anticipation of the influences its own MF-proclivities exerts on the execution of its planned future actions. Using a novel bandit task, wherein subjects were periodically allowed to design their environment, we show that reward-assignments were constructed in a manner consistent with a MB system taking account of its MF propensities. Thus, in the task participants assigned higher rewards to bandits that were momentarily associated with stronger MF tendencies. Our findings have implications for a range of decision making domains that includes drug abuse, pre-commitment, and the tension between short and long-term decision horizons in economics.

Download Full-text