CHQ: A Multi-Agent Reinforcement Learning Scheme for Partially Observable Markov Decision Processes

IEICE Transactions on Information and Systems ◽

10.1093/ietisy/e88-d.5.1004 ◽

2005 ◽

Vol E88-D (5) ◽

pp. 1004-1011 ◽

Author(s):

H. OSADA

Keyword(s):

Reinforcement Learning ◽

Markov Decision Processes ◽

Decision Processes ◽

Learning Scheme ◽

Markov Decision ◽

Multi Agent ◽

Partially Observable Markov ◽

Partially Observable

Download Full-text

CHQ: a multi-agent reinforcement learning scheme for partially observable markov decision processes

Proceedings. IEEE/WIC/ACM International Conference on Intelligent Agent Technology, 2004. (IAT 2004). ◽

10.1109/iat.2004.1342918 ◽

2004 ◽

Author(s):

H. Osada ◽

S. Fujita

Keyword(s):

Reinforcement Learning ◽

Markov Decision Processes ◽

Decision Processes ◽

Learning Scheme ◽

Markov Decision ◽

Multi Agent ◽

Partially Observable Markov ◽

Partially Observable

Download Full-text

A pulse neural network reinforcement learning algorithm for partially observable Markov decision processes

Systems and Computers in Japan ◽

10.1002/scj.10645 ◽

2005 ◽

Vol 36 (3) ◽

pp. 42-52 ◽

Author(s):

Koichiro Takita ◽

Masafumi Hagiwara

Keyword(s):

Neural Network ◽

Reinforcement Learning ◽

Markov Decision Processes ◽

Learning Algorithm ◽

Decision Processes ◽

Markov Decision ◽

Partially Observable Markov ◽

Partially Observable ◽

Reinforcement Learning Algorithm

Download Full-text

Cooperation and coordination between fuzzy reinforcement learning agents in continuous state partially observable Markov decision processes

FUZZ-IEEE'99. 1999 IEEE International Fuzzy Systems. Conference Proceedings (Cat. No.99CH36315) ◽

10.1109/fuzzy.1999.793014 ◽

1999 ◽

Author(s):

H.R. Berenji ◽

D. Vengerov

Keyword(s):

Reinforcement Learning ◽

Markov Decision Processes ◽

Decision Processes ◽

Learning Agents ◽

Continuous State ◽

Markov Decision ◽

Partially Observable Markov ◽

Partially Observable

Download Full-text

Recent Advances in Deep Reinforcement Learning Applications for Solving Partially Observable Markov Decision Processes (POMDP) Problems: Part 1—Fundamentals and Applications in Games, Robotics and Natural Language Processing

Machine Learning and Knowledge Extraction ◽

10.3390/make3030029 ◽

2021 ◽

Vol 3 (3) ◽

pp. 554-581

Author(s):

Xuanchen Xiang ◽

Simon Foo

Keyword(s):

Natural Language Processing ◽

Reinforcement Learning ◽

Natural Language ◽

Markov Decision Processes ◽

Language Processing ◽

Decision Processes ◽

Recent Advances ◽

Markov Decision ◽

Partially Observable Markov ◽

Partially Observable

The first part of a two-part series of papers provides a survey on recent advances in Deep Reinforcement Learning (DRL) applications for solving partially observable Markov decision processes (POMDP) problems. Reinforcement Learning (RL) is an approach to simulate the human’s natural learning process, whose key is to let the agent learn by interacting with the stochastic environment. The fact that the agent has limited access to the information of the environment enables AI to be applied efficiently in most fields that require self-learning. Although efficient algorithms are being widely used, it seems essential to have an organized investigation—we can make good comparisons and choose the best structures or algorithms when applying DRL in various applications. In this overview, we introduce Markov Decision Processes (MDP) problems and Reinforcement Learning and applications of DRL for solving POMDP problems in games, robotics, and natural language processing. A follow-up paper will cover applications in transportation, communications and networking, and industries.

Download Full-text

Guided Soft Actor Critic: A Guided Deep Reinforcement Learning Approach for Partially Observable Markov Decision Processes

IEEE Access ◽

10.1109/access.2021.3131772 ◽

2021 ◽

pp. 1-1

Author(s):

Mehmet Haklidir ◽

Hakan Temeltas

Keyword(s):

Reinforcement Learning ◽

Markov Decision Processes ◽

Decision Processes ◽

Learning Approach ◽

Markov Decision ◽

Partially Observable Markov ◽

Partially Observable

Download Full-text

A Deep Hierarchical Reinforcement Learning Algorithm in Partially Observable Markov Decision Processes

IEEE Access ◽

10.1109/access.2018.2854283 ◽

2018 ◽

Vol 6 ◽

pp. 49089-49102 ◽

Author(s):

Tuyen P. Le ◽

Ngo Anh Vien ◽

TaeChoong Chung

Keyword(s):

Reinforcement Learning ◽

Markov Decision Processes ◽

Learning Algorithm ◽

Decision Processes ◽

Hierarchical Reinforcement Learning ◽

Markov Decision ◽

Partially Observable Markov ◽

Partially Observable ◽

Reinforcement Learning Algorithm

Download Full-text

A State Space Filter for Reinforcement Learning in Partially Observable Markov Decision Processes

Transactions of the Society of Instrument and Control Engineers ◽

10.9746/sicetr.45.41 ◽

2009 ◽

Vol 45 (1) ◽

pp. 41-50 ◽

Author(s):

Masato Nagayoshi ◽

Hajime Murao ◽

Hisashi Tamaki

Keyword(s):

Reinforcement Learning ◽

State Space ◽

Markov Decision Processes ◽

Decision Processes ◽

Markov Decision ◽

Partially Observable Markov ◽

Partially Observable ◽

Download Full-text

Fuzzy reinforcement learning control for decentralized partially observable Markov decision processes

2011 IEEE International Conference on Fuzzy Systems (FUZZ-IEEE 2011) ◽

10.1109/fuzzy.2011.6007675 ◽

2011 ◽

Author(s):

Rajneesh Sharma ◽

Matthijs T. J. Spaan

Keyword(s):

Reinforcement Learning ◽

Markov Decision Processes ◽

Learning Control ◽

Decision Processes ◽

Markov Decision ◽

Partially Observable Markov ◽

Partially Observable

Download Full-text

Oracular Partially Observable Markov Decision Processes: A Very Special Case

Proceedings 2007 IEEE International Conference on Robotics and Automation ◽

10.1109/robot.2007.363691 ◽

2007 ◽

Author(s):

Nicholas Armstrong-Crews ◽

Manuela Veloso

Keyword(s):

Markov Decision Processes ◽

Decision Processes ◽

Markov Decision ◽

Partially Observable Markov ◽

Partially Observable ◽

Download Full-text

Active Chemical Sensing With Partially Observable Markov Decision Processes

10.1063/1.3156617 ◽

2009 ◽

Author(s):

Rakesh Gosangi ◽

Ricardo Gutierrez-Osuna ◽

Matteo Pardo ◽

Giorgio Sberveglieri

Keyword(s):

Markov Decision Processes ◽

Chemical Sensing ◽

Decision Processes ◽

Active Chemical ◽

Markov Decision ◽

Partially Observable Markov ◽

Partially Observable

Download Full-text