Hierarchical reinforcement learning and decision making

Human-robot interactive decision-making is increasingly becoming ubiquitous, and explainability is an influential factor in determining the reliance on autonomy. However, it is not reasonable to trust systems beyond our comprehension, and typical machine learning and data-driven decision-making are black-box paradigms that impede explainability. Therefore, it is critical to establish computational efficient decision-making mechanisms enhanced by explainability-aware strategies. To this end, we propose the Trustworthy Decision-Making (TDM), which is an explainable neuro-symbolic approach by integrating symbolic planning into hierarchical reinforcement learning. The framework of TDM enables the subtask-level explainability from the causal relational and understandable subtasks. Besides, TDM also demonstrates the advantage of the integration between symbolic planning and reinforcement learning, reaping the benefits of both worlds. Experimental results validate the effectiveness of proposed method while improving the explainability in the process of decision-making.

Download Full-text

Hierarchical Reinforcement Learning for Autonomous Decision Making and Motion Planning of Intelligent Vehicles

IEEE Access ◽

10.1109/access.2020.3034225 ◽

2020 ◽

Vol 8 ◽

pp. 209776-209789

Author(s):

Yang Lu ◽

Xin Xu ◽

Xinglong Zhang ◽

Lilin Qian ◽

Xing Zhou

Keyword(s):

Decision Making ◽

Reinforcement Learning ◽

Motion Planning ◽

Intelligent Vehicles ◽

Hierarchical Reinforcement Learning ◽

Autonomous Decision

Download Full-text

Human-Like Decision Making: Document-level Aspect Sentiment Classification via Hierarchical Reinforcement Learning

10.18653/v1/d19-1560 ◽

2019 ◽

Author(s):

Jingjing Wang ◽

Changlong Sun ◽

Shoushan Li ◽

Jiancheng Wang ◽

Luo Si ◽

...

Keyword(s):

Decision Making ◽

Reinforcement Learning ◽

Sentiment Classification ◽

Hierarchical Reinforcement Learning ◽

Document Level ◽

Level Aspect

Download Full-text

Deep hierarchical reinforcement learning in a markov game applied to fishery management decision making

2020 IEEE Symposium Series on Computational Intelligence (SSCI) ◽

10.1109/ssci47803.2020.9308606 ◽

2020 ◽

Author(s):

Poiron-Guidoni Nicolas ◽

Bisgambiglia Paul-Antoine

Keyword(s):

Decision Making ◽

Reinforcement Learning ◽

Fishery Management ◽

Management Decision ◽

Hierarchical Reinforcement Learning ◽

Markov Game ◽

Management Decision Making

Download Full-text

Model-based hierarchical reinforcement learning and human action control

Philosophical Transactions of the Royal Society B Biological Sciences ◽

10.1098/rstb.2013.0480 ◽

2014 ◽

Vol 369 (1655) ◽

pp. 20130480 ◽

Cited By ~ 63

Author(s):

Matthew Botvinick ◽

Ari Weinstein

Keyword(s):

Decision Making ◽

Reinforcement Learning ◽

Hierarchical Model ◽

Human Action ◽

Action Control ◽

Computational Framework ◽

Hierarchical Reinforcement Learning ◽

Model Based ◽

Human Decision ◽

Areas Of Interest

Recent work has reawakened interest in goal-directed or ‘model-based’ choice, where decisions are based on prospective evaluation of potential action outcomes. Concurrently, there has been growing attention to the role of hierarchy in decision-making and action control. We focus here on the intersection between these two areas of interest, considering the topic of hierarchical model-based control. To characterize this form of action control, we draw on the computational framework of hierarchical reinforcement learning, using this to interpret recent empirical findings. The resulting picture reveals how hierarchical model-based mechanisms might play a special and pivotal role in human decision-making, dramatically extending the scope and complexity of human behaviour.

Download Full-text

Hierarchical reinforcement learning and decision making for intelligent machines

Proceedings of the 1994 IEEE International Conference on Robotics and Automation ◽

10.1109/robot.1994.351014 ◽

2002 ◽

Cited By ~ 2

Author(s):

P. Lima ◽

G. Saridis

Keyword(s):

Decision Making ◽

Reinforcement Learning ◽

Hierarchical Reinforcement Learning ◽

Intelligent Machines

Download Full-text

Hierarchical reinforcement learning for self-driving decision-making without reliance on labelled driving data

IET Intelligent Transport Systems ◽

10.1049/iet-its.2019.0317 ◽

2020 ◽

Vol 14 (5) ◽

pp. 297-305 ◽

Cited By ~ 5

Author(s):

Jingliang Duan ◽

Shengbo Eben Li ◽

Yang Guan ◽

Qi Sun ◽

Bo Cheng

Keyword(s):

Decision Making ◽

Reinforcement Learning ◽

Hierarchical Reinforcement Learning

Download Full-text

PEORL: Integrating Symbolic Planning and Hierarchical Reinforcement Learning for Robust Decision-Making

Proceedings of the Twenty-Seventh International Joint Conference on Artificial Intelligence ◽

10.24963/ijcai.2018/675 ◽

2018 ◽

Cited By ~ 12

Author(s):

Fangkai Yang ◽

Daoming Lyu ◽

Bo Liu ◽

Steven Gustafson

Keyword(s):

Decision Making ◽

Reinforcement Learning ◽

Autonomous Agents ◽

Dynamic Environment ◽

Unified Framework ◽

Task Execution ◽

Policy Search ◽

Hierarchical Reinforcement Learning ◽

Improve Planning ◽

Symbolic Planning

Reinforcement learning and symbolic planning have both been used to build intelligent autonomous agents. Reinforcement learning relies on learning from interactions with real world, which often requires an unfeasibly large amount of experience. Symbolic planning relies on manually crafted symbolic knowledge, which may not be robust to domain uncertainties and changes. In this paper we present a unified framework PEORL that integrates symbolic planning with hierarchical reinforcement learning (HRL) to cope with decision-making in dynamic environment with uncertainties. Symbolic plans are used to guide the agent's task execution and learning, and the learned experience is fed back to symbolic knowledge to improve planning. This method leads to rapid policy search and robust symbolic plans in complex domains. The framework is tested on benchmark domains of HRL.

Download Full-text

Tactical Decision-Making in Autonomous Driving by Reinforcement Learning with Uncertainty Estimation

2020 IEEE Intelligent Vehicles Symposium (IV) ◽

10.1109/iv47402.2020.9304614 ◽

2020 ◽

Author(s):

Carl-Johan Hoel ◽

Krister Wolff ◽

Leo Laine

Keyword(s):

Decision Making ◽

Reinforcement Learning ◽

Autonomous Driving ◽

Uncertainty Estimation ◽

Tactical Decision

Download Full-text

Individual differences in experienced and observational decision-making illuminate interactions between reinforcement learning and declarative memory

Scientific Reports ◽

10.1038/s41598-021-85322-2 ◽

2021 ◽

Vol 11 (1) ◽

Author(s):

Batel Yifrah ◽

Ayelet Ramaty ◽

Genela Morris ◽

Avi Mendelsohn

Keyword(s):

Decision Making ◽

Reinforcement Learning ◽

Declarative Memory ◽

Contextual Information ◽

Memory Performance ◽

Relevant Information ◽

Subjective Memory ◽

Types Of Information ◽

Reinforcement Learning Models ◽

Implicit And Explicit

AbstractDecision making can be shaped both by trial-and-error experiences and by memory of unique contextual information. Moreover, these types of information can be acquired either by means of active experience or by observing others behave in similar situations. The interactions between reinforcement learning parameters that inform decision updating and memory formation of declarative information in experienced and observational learning settings are, however, unknown. In the current study, participants took part in a probabilistic decision-making task involving situations that either yielded similar outcomes to those of an observed player or opposed them. By fitting alternative reinforcement learning models to each subject, we discerned participants who learned similarly from experience and observation from those who assigned different weights to learning signals from these two sources. Participants who assigned different weights to their own experience versus those of others displayed enhanced memory performance as well as subjective memory strength for episodes involving significant reward prospects. Conversely, memory performance of participants who did not prioritize their own experience over others did not seem to be influenced by reinforcement learning parameters. These findings demonstrate that interactions between implicit and explicit learning systems depend on the means by which individuals weigh relevant information conveyed via experience and observation.

Download Full-text