A constraint partially observable semi-Markov decision process for the attack–defence relationships in various critical infrastructures

Background: Bug tracking systems receive many bug reports daily. Although the software quality team aims to identify and resolve these bugs, they are never able to fix all of the reported bugs in the issue tracking system before the release deadline. However, postponing the bug fixing may have some consequences. Prioritization of bug reports will help the software manager decide which bugs to fix and which bugs to postpone. Typically, bug reports are prioritized based on the severity, priority, time and effort for fixing, customer pressure, etc. Aim: Previous studies have shown that these factors may not be appropriate for prioritization. Therefore, relying on them to automate bug prioritization might be misleading. In this dissertation, we aim to prioritize bug reports with respect to the consequence of not fixing the bugs in terms of their relative importance in the issue tracking system. Method: In order to measure the relative importance of bugs in the issue tracking system, we propose the construction of a dependency graph based on the reported dependency-blocking information in the issue tracking system. Two metrics, namely depth and degree, are used to measure the relative importance of the bugs. However, there is uncertainty in the dependency graph structure as the dependency information is discovered manually and gradually. Owing to this uncertainty, prioritization of bugs in the descending order of depth and degree may be misleading. To handle the uncertainty, we propose a novel approach of a partially observable Markov decision process (POMDP) and partially observable Monte Carlo planning (POMCP). Result: To check the feasibility of the proposed approach, we analyzed seven years of data from an open source project, Firefox, and a commercial project. We compared the proposed policy with the developer policy, maximum policy, and random policy. Conclusion: The results suggest that software practitioners do not consider the relative importance of bugs in their current practice. The proposed framework can be combined with practitioners’ expertise to prioritize bugs more effectively and take the depth and degree of bugs into account. In practice, the POMDP framework with the POMCP planner can help practitioners sequentially select bugs to minimize the connectivity of the dependency graph.

Download Full-text

Partially Observable Markov Decision Process (POMDP) Technologies for Sign Language Based Human-Computer Interaction

Lecture Notes in Computer Science - Universal Access in Human-Computer Interaction. Applications and Services ◽

10.1007/978-3-642-02713-0_61 ◽

2009 ◽

pp. 577-586

Author(s):

Sylvie C. W. Ong ◽

David Hsu ◽

Wee Sun Lee ◽

Hanna Kurniawati

Keyword(s):

Human Computer Interaction ◽

Sign Language ◽

Markov Decision Process ◽

Decision Process ◽

Markov Decision ◽

Partially Observable Markov ◽

Partially Observable ◽

Computer Interaction

Download Full-text

Partially Observable Markov Decision Process Approximations for Adaptive Sensing

Discrete Event Dynamic Systems ◽

10.1007/s10626-009-0071-x ◽

2009 ◽

Vol 19 (3) ◽

pp. 377-422 ◽

Cited By ~ 46

Author(s):

Edwin K. P. Chong ◽

Christopher M. Kreucher ◽

Alfred O. Hero

Keyword(s):

Markov Decision Process ◽

Decision Process ◽

Adaptive Sensing ◽

Markov Decision ◽

Partially Observable Markov ◽

Partially Observable

Download Full-text

Replacement policy for a partially observable Markov decision process model using fuzzy data

Computers & Industrial Engineering ◽

10.1016/0360-8352(93)90314-n ◽

1993 ◽

Vol 25 (1-4) ◽

pp. 435-438

Author(s):

Chang Eun Kim ◽

Mitsuo Gen

Keyword(s):

Markov Decision Process ◽

Decision Process ◽

Process Model ◽

Replacement Policy ◽

Fuzzy Data ◽

Markov Decision ◽

Partially Observable Markov ◽

Partially Observable

Download Full-text

Monte-Carlo-based partially observable Markov decision process approximations for adaptive sensing

2008 9th International Workshop on Discrete Event Systems ◽

10.1109/wodes.2008.4605941 ◽

2008 ◽

Cited By ~ 9

Author(s):

Edwin K. P. Chong ◽

Christopher M. Kreucher ◽

Alfred O. Hero

Keyword(s):

Monte Carlo ◽

Markov Decision Process ◽

Decision Process ◽

Adaptive Sensing ◽

Markov Decision ◽

Partially Observable Markov ◽

Partially Observable

Download Full-text

COG-DICE: An Algorithm for Solving Continuous-Observation Dec-POMDPs

Proceedings of the Twenty-Sixth International Joint Conference on Artificial Intelligence ◽

10.24963/ijcai.2017/638 ◽

2017 ◽

Author(s):

Madison Clark-Turner ◽

Christopher Amato

Keyword(s):

Markov Decision Process ◽

Real World ◽

Decision Process ◽

Extended Version ◽

Continuous Observation ◽

Solution Methods ◽

Markov Decision ◽

Multi Agent ◽

Partially Observable Markov ◽

Partially Observable

The decentralized partially observable Markov decision process (Dec-POMDP) is a powerful model for representing multi-agent problems with decentralized behavior. Unfortunately, current Dec-POMDP solution methods cannot solve problems with continuous observations, which are common in many real-world domains. To that end, we present a framework for representing and generating Dec-POMDP policies that explicitly include continuous observations. We apply our algorithm to a novel tagging problem and an extended version of a common benchmark, where it generates policies that meet or exceed the values of equivalent discretized domains without the need for finding an adequate discretization.

Download Full-text

Autonomous Robotic Manipulation as a Partially Observable Markov Decision Process

Robotics & Automation Engineering Journal ◽

10.19080/raej.2018.02.555580 ◽

2017 ◽

Vol 2 (1) ◽

Author(s):

Lei Cui

Keyword(s):

Markov Decision Process ◽

Decision Process ◽

Robotic Manipulation ◽

Markov Decision ◽

Partially Observable Markov ◽

Partially Observable

Download Full-text

Universal Reinforcement Learning Algorithms: Survey and Experiments

Proceedings of the Twenty-Sixth International Joint Conference on Artificial Intelligence ◽

10.24963/ijcai.2017/194 ◽

2017 ◽

Author(s):

John Aslanides ◽

Jan Leike ◽

Marcus Hutter

Keyword(s):

Reinforcement Learning ◽

Open Source ◽

Markov Decision Process ◽

Decision Process ◽

Empirical Investigation ◽

State Of The Art ◽

Learning Algorithms ◽

Markov Decision ◽

Reference Implementation ◽

Partially Observable

Many state-of-the-art reinforcement learning (RL) algorithms typically assume that the environment is an ergodic Markov Decision Process (MDP). In contrast, the field of universal reinforcement learning (URL) is concerned with algorithms that make as few assumptions as possible about the environment. The universal Bayesian agent AIXI and a family of related URL algorithms have been developed in this setting. While numerous theoretical optimality results have been proven for these agents, there has been no empirical investigation of their behavior to date. We present a short and accessible survey of these URL algorithms under a unified notation and framework, along with results of some experiments that qualitatively illustrate some properties of the resulting policies, and their relative performance on partially-observable gridworld environments. We also present an open- source reference implementation of the algorithms which we hope will facilitate further understanding of, and experimentation with, these ideas.

Download Full-text