An Algebraic Graphical Model for Decision with Uncertainties, Feasibilities, and Utilities

Numerous formalisms and dedicated algorithms have been designed in the last decades to model and solve decision making problems. Some formalisms, such as constraint networks, can express "simple" decision problems, while others are designed to take into account uncertainties, unfeasible decisions, and utilities. Even in a single formalism, several variants are often proposed to model different types of uncertainty (probability, possibility...) or utility (additive or not). In this article, we introduce an algebraic graphical model that encompasses a large number of such formalisms: (1) we first adapt previous structures from Friedman, Chu and Halpern for representing uncertainty, utility, and expected utility in order to deal with generic forms of sequential decision making; (2) on these structures, we then introduce composite graphical models that express information via variables linked by "local" functions, thanks to conditional independence; (3) on these graphical models, we finally define a simple class of queries which can represent various scenarios in terms of observabilities and controllabilities. A natural decision-tree semantics for such queries is completed by an equivalent operational semantics, which induces generic algorithms. The proposed framework, called the Plausibility-Feasibility-Utility (PFU) framework, not only provides a better understanding of the links between existing formalisms, but it also covers yet unpublished frameworks (such as possibilistic influence diagrams) and unifies formalisms such as quantified boolean formulas and influence diagrams. Our backtrack and variable elimination generic algorithms are a first step towards unified algorithms.

Download Full-text

State-Based Recurrent SPMNs for Decision-Theoretic Planning under Partial Observability

Proceedings of the Thirtieth International Joint Conference on Artificial Intelligence ◽

10.24963/ijcai.2021/348 ◽

2021 ◽

Author(s):

Layton Hayes ◽

Prashant Doshi ◽

Swaraj Pawar ◽

Hari Teja Tatavarti

Keyword(s):

Decision Making ◽

Graphical Models ◽

Structure Learning ◽

Sequence Data ◽

Sequential Decision ◽

Partial Observability ◽

Linear Solution ◽

Matching Process ◽

Data Driven Approach ◽

Belief States

The sum-product network (SPN) has been extended to model sequence data with the recurrent SPN (RSPN), and to decision-making problems with sum-product-max networks (SPMN). In this paper, we build on the concepts introduced by these extensions and present state-based recurrent SPMNs (S-RSPMNs) as a generalization of SPMNs to sequential decision-making problems where the state may not be perfectly observed. As with recurrent SPNs, S-RSPMNs utilize a repeatable template network to model sequences of arbitrary lengths. We present an algorithm for learning compact template structures by identifying unique belief states and the transitions between them through a state matching process that utilizes augmented data. In our knowledge, this is the first data-driven approach that learns graphical models for planning under partial observability, which can be solved efficiently. S-RSPMNs retain the linear solution complexity of SPMNs, and we demonstrate significant improvements in compactness of representation and the run time of structure learning and inference in sequential domains.

Download Full-text

Toward data-driven solutions to interactive dynamic influence diagrams

Knowledge and Information Systems ◽

10.1007/s10115-021-01600-5 ◽

2021 ◽

Author(s):

Yinghui Pan ◽

Jing Tang ◽

Biyang Ma ◽

Yifeng Zeng ◽

Zhong Ming

Keyword(s):

Decision Making ◽

Domain Knowledge ◽

Strong Support ◽

Study Data ◽

Data Driven ◽

Influence Diagrams ◽

Sequential Decision ◽

Data Driven Decision Making ◽

Data Driven Approach ◽

Interactive Dynamic

AbstractWith the availability of significant amount of data, data-driven decision making becomes an alternative way for solving complex multiagent decision problems. Instead of using domain knowledge to explicitly build decision models, the data-driven approach learns decisions (probably optimal ones) from available data. This removes the knowledge bottleneck in the traditional knowledge-driven decision making, which requires a strong support from domain experts. In this paper, we study data-driven decision making in the context of interactive dynamic influence diagrams (I-DIDs)—a general framework for multiagent sequential decision making under uncertainty. We propose a data-driven framework to solve the I-DIDs model and focus on learning the behavior of other agents in problem domains. The challenge is on learning a complete policy tree that will be embedded in the I-DIDs models due to limited data. We propose two new methods to develop complete policy trees for the other agents in the I-DIDs. The first method uses a simple clustering process, while the second one employs sophisticated statistical checks. We analyze the proposed algorithms in a theoretical way and experiment them over two problem domains.

Download Full-text

Introduction to Bayesian Networks and Influence Diagrams

Decision Theory Models for Applications in Artificial Intelligence ◽

10.4018/978-1-60960-165-2.ch002 ◽

2012 ◽

pp. 9-32

Author(s):

Luis Enrique Sucar

Keyword(s):

Probability Theory ◽

Bayesian Networks ◽

Graphical Models ◽

Probabilistic Graphical Models ◽

Dynamic Bayesian Networks ◽

Influence Diagrams ◽

General Description ◽

Sequential Decision ◽

The Core ◽

Inference Techniques

In this chapter we will cover the fundamentals of probabilistic graphical models, in particular Bayesian networks and influence diagrams, which are the basis for some of the techniques and applications that are described in the rest of the book. First we will give a general introduction to probabilistic graphical models, including the motivation for using these models, and a brief history and general description of the main types of models. We will also include a brief review of the basis of probability theory. The core of the chapter will be the next three sections devoted to: (i) Bayesian networks, (ii) Dynamic Bayesian networks and (iii) Influence diagrams. For each we will introduce the models, their properties and give some examples. We will briefly describe the main inference techniques for the three types of models. For Bayesian and dynamic Bayesian nets we will talk about learning, including structure and parameter learning, describing the main types of approaches. At the end of the section on influence diagrams we will briefly introduce sequential decision problems as a link to the chapter on MDPs and POMDPs. We conclude the chapter with a summary and pointers for further reading for each topic.

Download Full-text

Supplemental Material for Melioration as Rational Choice: Sequential Decision Making in Uncertain Environments

Psychological Review ◽

10.1037/a0030850.supp ◽

2012 ◽

Keyword(s):

Decision Making ◽

Rational Choice ◽

Sequential Decision Making ◽

Sequential Decision ◽

Uncertain Environments

Download Full-text

The Contrasting Effects of Perceived Control: Implications for Sequential Decision-Making

PsycEXTRA Dataset ◽

10.1037/e504392014-030 ◽

2013 ◽

Author(s):

Maggie Y. Chu ◽

Robert S. Wyer ◽

Lisa C. Wan

Keyword(s):

Decision Making ◽

Perceived Control ◽

Sequential Decision Making ◽

Sequential Decision

Download Full-text

Human and Optimal Valuation in a Sequential Decision-Making Task With Uncertainty

PsycEXTRA Dataset ◽

10.1037/e527342012-505 ◽

2007 ◽

Author(s):

Kyler M. Eastman ◽

Brian J. Stankiewicz ◽

Alex C. Huk

Keyword(s):

Decision Making ◽

Sequential Decision Making ◽

Sequential Decision

Download Full-text

Losing a dime with a satisfied mind: Positive affect accounts for age-related differences in sequential decision making

PsycEXTRA Dataset ◽

10.1037/e615882011-002 ◽

2009 ◽

Author(s):

Bettina von Helversen ◽

Rui Mata

Keyword(s):

Decision Making ◽

Positive Affect ◽

Sequential Decision Making ◽

Sequential Decision ◽

Age Related

Download Full-text

Stopping Policies in Sequential Decision Making

PsycEXTRA Dataset ◽

10.1037/e722982011-071 ◽

1993 ◽

Author(s):

Gad Saad ◽

J. Edward Russo

Keyword(s):

Decision Making ◽

Sequential Decision Making ◽

Sequential Decision

Download Full-text

Introduction to Sequential Decision-Making

Reciprocity, Evolution, and Decision Games in Network and Data Science ◽

10.1017/9781108859783.015 ◽

2021 ◽

pp. 249-252

Keyword(s):

Decision Making ◽

Sequential Decision Making ◽

Sequential Decision

Download Full-text

Optimal Policies for Quantum Markov Decision Processes

International Journal of Automation and Computing ◽

10.1007/s11633-021-1278-z ◽

2021 ◽

Author(s):

Ming-Sheng Ying ◽

Yuan Feng ◽

Sheng-Gang Ying

Keyword(s):

Decision Making ◽

Reinforcement Learning ◽

Quantum Systems ◽

Sequential Decision Making ◽

Mathematical Framework ◽

Sequential Decision ◽

Learning Techniques ◽

Optimal Policies ◽

Markov Decision ◽

Programming Algorithms

AbstractMarkov decision process (MDP) offers a general framework for modelling sequential decision making where outcomes are random. In particular, it serves as a mathematical framework for reinforcement learning. This paper introduces an extension of MDP, namely quantum MDP (qMDP), that can serve as a mathematical model of decision making about quantum systems. We develop dynamic programming algorithms for policy evaluation and finding optimal policies for qMDPs in the case of finite-horizon. The results obtained in this paper provide some useful mathematical tools for reinforcement learning techniques applied to the quantum world.

Download Full-text