partially observable markov Latest Research Papers

Partially Observable Markov Decision Processes and Robotics

Annual Review of Control Robotics and Autonomous Systems ◽

10.1146/annurev-control-042920-092451 ◽

2022 ◽

Vol 5 (1) ◽

Author(s):

Hanna Kurniawati

Keyword(s):

Autonomous Systems ◽

Optimal Solution ◽

Lessons Learned ◽

Annual Review ◽

Publication Date ◽

Mathematical Framework ◽

Planning Under Uncertainty ◽

Markov Decision ◽

Partially Observable Markov ◽

Partially Observable

Planning under uncertainty is critical to robotics. The partially observable Markov decision process (POMDP) is a mathematical framework for such planning problems. POMDPs are powerful because of their careful quantification of the nondeterministic effects of actions and the partial observability of the states. But for the same reason, they are notorious for their high computational complexity and have been deemed impractical for robotics. However, over the past two decades, the development of sampling-based approximate solvers has led to tremendous advances in POMDP-solving capabilities. Although these solvers do not generate the optimal solution, they can compute good POMDP solutions that significantly improve the robustness of robotics systems within reasonable computational resources, thereby making POMDPs practical for many realistic robotics problems. This article presents a review of POMDPs, emphasizing computational issues that have hindered their practicality in robotics and ideas in sampling-based solvers that have alleviated such difficulties, together with lessons learned from applying POMDPs to physical robots. Expected final online publication date for the Annual Review of Control, Robotics, and Autonomous Systems, Volume 5 is May 2022. Please see http://www.annualreviews.org/page/journal/pubdates for revised estimates.

Scalable grid‐based approximation algorithms for partially observable Markov decision processes

Concurrency and Computation Practice and Experience ◽

10.1002/cpe.6743 ◽

2021 ◽

Author(s):

Can Kavaklioglu ◽

Mucahit Cevik

Keyword(s):

Approximation Algorithms ◽

Markov Decision Processes ◽

Decision Processes ◽

Markov Decision ◽

Partially Observable Markov ◽

Partially Observable ◽

Grid Based

Task-Aware Verifiable RNN-Based Policies for Partially Observable Markov Decision Processes

Journal of Artificial Intelligence Research ◽

10.1613/jair.1.12963 ◽

2021 ◽

Vol 72 ◽

pp. 819-847

Author(s):

Steven Carr ◽

Nils Jansen ◽

Ufuk Topcu

Keyword(s):

Machine Learning ◽

Markov Chain ◽

Temporal Logic ◽

Markov Decision Processes ◽

Diagnostic Information ◽

Decision Processes ◽

Verification Methods ◽

Markov Decision ◽

Partially Observable Markov ◽

Partially Observable

Partially observable Markov decision processes (POMDPs) are models for sequential decision-making under uncertainty and incomplete information. Machine learning methods typically train recurrent neural networks (RNN) as effective representations of POMDP policies that can efficiently process sequential data. However, it is hard to verify whether the POMDP driven by such RNN-based policies satisfies safety constraints, for instance, given by temporal logic specifications. We propose a novel method that combines techniques from machine learning with the field of formal methods: training an RNN-based policy and then automatically extracting a so-called finite-state controller (FSC) from the RNN. Such FSCs offer a convenient way to verify temporal logic constraints. Implemented on a POMDP, they induce a Markov chain, and probabilistic verification methods can efficiently check whether this induced Markov chain satisfies a temporal logic specification. Using such methods, if the Markov chain does not satisfy the specification, a byproduct of verification is diagnostic information about the states in the POMDP that are critical for the specification. The method exploits this diagnostic information to either adjust the complexity of the extracted FSC or improve the policy by performing focused retraining of the RNN. The method synthesizes policies that satisfy temporal logic specifications for POMDPs with up to millions of states, which are three orders of magnitude larger than comparable approaches.

A primer on Partially Observable Markov Decision Processes (POMDPs)

Methods in Ecology and Evolution ◽

10.1111/2041-210x.13692 ◽

2021 ◽

Author(s):

Iadine Chades ◽

Luz V. Pascal ◽

Sam Nicol ◽

Cameron S. Fletcher ◽

Jonathan Ferrer Mestres

Keyword(s):

Markov Decision Processes ◽

Decision Processes ◽

Markov Decision ◽

Partially Observable Markov ◽

Partially Observable

Motion Planning of Mobile Robots in Indoor Topological Environments using Partially Observable Markov Decision Process

IEEE Latin America Transactions ◽

10.1109/tla.2021.9475862 ◽

2021 ◽

Vol 19 (8) ◽

pp. 1315-1324

Author(s):

Neemias Silva Monteiro ◽

Vinicius Mariano Goncalves ◽

Carlos Andrey Maia

Keyword(s):

Motion Planning ◽

Mobile Robots ◽

Markov Decision Process ◽

Decision Process ◽

Markov Decision ◽

Partially Observable Markov ◽

Partially Observable

Recent Advances in Deep Reinforcement Learning Applications for Solving Partially Observable Markov Decision Processes (POMDP) Problems: Part 1—Fundamentals and Applications in Games, Robotics and Natural Language Processing

Machine Learning and Knowledge Extraction ◽

10.3390/make3030029 ◽

2021 ◽

Vol 3 (3) ◽

pp. 554-581

Author(s):

Xuanchen Xiang ◽

Simon Foo

Keyword(s):

Natural Language Processing ◽

Reinforcement Learning ◽

Natural Language ◽

Markov Decision Processes ◽

Language Processing ◽

Decision Processes ◽

Recent Advances ◽

Markov Decision ◽

Partially Observable Markov ◽

Partially Observable

The first part of a two-part series of papers provides a survey on recent advances in Deep Reinforcement Learning (DRL) applications for solving partially observable Markov decision processes (POMDP) problems. Reinforcement Learning (RL) is an approach to simulate the human’s natural learning process, whose key is to let the agent learn by interacting with the stochastic environment. The fact that the agent has limited access to the information of the environment enables AI to be applied efficiently in most fields that require self-learning. Although efficient algorithms are being widely used, it seems essential to have an organized investigation—we can make good comparisons and choose the best structures or algorithms when applying DRL in various applications. In this overview, we introduce Markov Decision Processes (MDP) problems and Reinforcement Learning and applications of DRL for solving POMDP problems in games, robotics, and natural language processing. A follow-up paper will cover applications in transportation, communications and networking, and industries.

Partially Observable Markov Model Based Multi-Armed Bandits Structured Radar Emission Management Algorithm

2021 29th Signal Processing and Communications Applications Conference (SIU) ◽

10.1109/siu53274.2021.9478025 ◽

2021 ◽

Author(s):

Nuri Baran Ayana ◽

Duygu Acar Icer ◽

Hasan Ihsan Turhan ◽

Mubeccel Demirekler

Keyword(s):

Markov Model ◽

Model Based ◽

Management Algorithm ◽

Partially Observable Markov ◽

Partially Observable

Mitigating Non-attendance Using Clinic-Resourced Incentives Can Be Mutually Beneficial: A Contingency Management-Inspired Partially Observable Markov Decision Process Model

Value in Health ◽

10.1016/j.jval.2021.03.014 ◽

2021 ◽

Author(s):

Yunxiang Bai ◽

Bjorn P. Berg

Keyword(s):

Markov Decision Process ◽

Decision Process ◽

Process Model ◽

Contingency Management ◽

Markov Decision ◽

Partially Observable Markov ◽

Partially Observable

Maintenance Decision-making Model Based on Partially Observable Markov for Railway Traction Substation Equipment

10.1109/ccdc52312.2021.9602792 ◽

2021 ◽

Author(s):

Pengfei Guo ◽

Zhihua Wang ◽

Junyao Zhang

Keyword(s):

Decision Making ◽

Traction Substation ◽

Model Based ◽

Railway Traction ◽

Maintenance Decision ◽

Substation Equipment ◽

Partially Observable Markov ◽

Partially Observable ◽

Decision Making Model

Partially observable Markov Decision Process to prioritize software defects

10.32920/ryerson.14638470 ◽

2021 ◽

Author(s):

Shirin Akbarinasaji

Keyword(s):

Markov Decision Process ◽

Decision Process ◽

Tracking System ◽

Dependency Graph ◽

Relative Importance ◽

Bug Reports ◽

Markov Decision ◽

Partially Observable Markov ◽

Partially Observable ◽

Issue Tracking System

Background: Bug tracking systems receive many bug reports daily. Although the software quality team aims to identify and resolve these bugs, they are never able to fix all of the reported bugs in the issue tracking system before the release deadline. However, postponing the bug fixing may have some consequences. Prioritization of bug reports will help the software manager decide which bugs to fix and which bugs to postpone. Typically, bug reports are prioritized based on the severity, priority, time and effort for fixing, customer pressure, etc. Aim: Previous studies have shown that these factors may not be appropriate for prioritization. Therefore, relying on them to automate bug prioritization might be misleading. In this dissertation, we aim to prioritize bug reports with respect to the consequence of not fixing the bugs in terms of their relative importance in the issue tracking system. Method: In order to measure the relative importance of bugs in the issue tracking system, we propose the construction of a dependency graph based on the reported dependency-blocking information in the issue tracking system. Two metrics, namely depth and degree, are used to measure the relative importance of the bugs. However, there is uncertainty in the dependency graph structure as the dependency information is discovered manually and gradually. Owing to this uncertainty, prioritization of bugs in the descending order of depth and degree may be misleading. To handle the uncertainty, we propose a novel approach of a partially observable Markov decision process (POMDP) and partially observable Monte Carlo planning (POMCP). Result: To check the feasibility of the proposed approach, we analyzed seven years of data from an open source project, Firefox, and a commercial project. We compared the proposed policy with the developer policy, maximum policy, and random policy. Conclusion: The results suggest that software practitioners do not consider the relative importance of bugs in their current practice. The proposed framework can be combined with practitioners’ expertise to prioritize bugs more effectively and take the depth and degree of bugs into account. In practice, the POMDP framework with the POMCP planner can help practitioners sequentially select bugs to minimize the connectivity of the dependency graph.

partially observable markov
Recently Published Documents

TOTAL DOCUMENTS

H-INDEX

Partially Observable Markov Decision Processes and Robotics

Scalable grid‐based approximation algorithms for partially observable Markov decision processes

Task-Aware Verifiable RNN-Based Policies for Partially Observable Markov Decision Processes

A primer on Partially Observable Markov Decision Processes (POMDPs)

Motion Planning of Mobile Robots in Indoor Topological Environments using Partially Observable Markov Decision Process

Recent Advances in Deep Reinforcement Learning Applications for Solving Partially Observable Markov Decision Processes (POMDP) Problems: Part 1—Fundamentals and Applications in Games, Robotics and Natural Language Processing

Partially Observable Markov Model Based Multi-Armed Bandits Structured Radar Emission Management Algorithm

Mitigating Non-attendance Using Clinic-Resourced Incentives Can Be Mutually Beneficial: A Contingency Management-Inspired Partially Observable Markov Decision Process Model

Maintenance Decision-making Model Based on Partially Observable Markov for Railway Traction Substation Equipment

Partially observable Markov Decision Process to prioritize software defects

Export Citation Format

partially observable markovRecently Published Documents

TOTAL DOCUMENTS

H-INDEX

Partially Observable Markov Decision Processes and Robotics

Scalable grid‐based approximation algorithms for partially observable Markov decision processes

Task-Aware Verifiable RNN-Based Policies for Partially Observable Markov Decision Processes

A primer on Partially Observable Markov Decision Processes (POMDPs)

Motion Planning of Mobile Robots in Indoor Topological Environments using Partially Observable Markov Decision Process

Recent Advances in Deep Reinforcement Learning Applications for Solving Partially Observable Markov Decision Processes (POMDP) Problems: Part 1—Fundamentals and Applications in Games, Robotics and Natural Language Processing

Partially Observable Markov Model Based Multi-Armed Bandits Structured Radar Emission Management Algorithm

Mitigating Non-attendance Using Clinic-Resourced Incentives Can Be Mutually Beneficial: A Contingency Management-Inspired Partially Observable Markov Decision Process Model

Maintenance Decision-making Model Based on Partially Observable Markov for Railway Traction Substation Equipment

Partially observable Markov Decision Process to prioritize software defects

partially observable markov
Recently Published Documents