Battlefield Agent Decision-Making Based on Markov Decision Process

Battlefield decision-making is an important part of modern information warfare. It can analyze and integrate battlefield information, reduce operators’ work and assist them to make decisions quickly in complex battlefield environment. The paper presents a dynamic battlefield decision-making method based on Markov Decision Processes (MDP). By this method, operators can get decision support quickly in the case of incomplete information. In order to improve the credibility of decisions, dynamic adaptability and intelligence, softmax regression and random forest are introduced to improve the MDP model. Simulations show that the method is intuitive and practical, and has remarkable advantages in solving the dynamic decision problems under incomplete information.

Download Full-text

A Markov Decision Process Model for Traffic Prioritisation Provisioning

10.28945/2750 ◽

2004 ◽

Author(s):

Abdullah Gani ◽

Omar Zakaria ◽

Nor Badrul Anuar Jumaat

Keyword(s):

Decision Making ◽

Markov Decision Process ◽

Decision Process ◽

Process Model ◽

Decision Problems ◽

Sequential Decision ◽

Best Effort ◽

Sequential Decision Problems ◽

Markov Decision ◽

Decision Making Processes

This paper presents an application of Markov Decision Process (MDP) into the provision of traffic prioritisation in the best-effort networks. MDP was used because it is a standard, general formalism for modelling stochastic, sequential decision problems. The implementation of traffic prioritisation involves a series of decision making processes by which packets are marked and classified before being despatched to destinations. The application of MDP was driven by the objective of ensuring the higher priority packets are not delayed by the lower ones. The MDP is believed to be applicable in improving the traffic prioritisation arbitration.

Download Full-text

Markov decision processes for integrating life cycle dynamics into fab-level decision making

IFAC Proceedings Volumes ◽

10.1016/s1474-6670(17)56827-5 ◽

1999 ◽

Vol 32 (2) ◽

pp. 4852-4857

Author(s):

Shalabh Bhatnagar ◽

Michael C. Fu ◽

Steven I. Marcus ◽

Ying He

Keyword(s):

Decision Making ◽

Life Cycle ◽

Markov Decision Processes ◽

Decision Processes ◽

Markov Decision

Download Full-text

Operational State Evaluation and Maintenance Decision-making Method for Multi-state CNC Machine Tools based on Partially Observable Markov Decision Process

2020 International Conference on Sensing, Diagnostics, Prognostics, and Control (SDPC) ◽

10.1109/sdpc49476.2020.9353134 ◽

2020 ◽

Author(s):

Fang Zixuan ◽

Wang Xiaodong ◽

Wang Lifang

Keyword(s):

Decision Making ◽

Markov Decision Process ◽

Decision Process ◽

Machine Tools ◽

Cnc Machine Tools ◽

Cnc Machine ◽

State Evaluation ◽

Markov Decision ◽

Maintenance Decision ◽

Partially Observable

Download Full-text

Deterministic policies based on maximum regrets in MDPs with imprecise rewards

AI Communications ◽

10.3233/aic-190632 ◽

2021 ◽

pp. 1-16

Author(s):

Pegah Alizadeh ◽

Emiliano Traversi ◽

Aomar Osmani

Keyword(s):

Decision Making ◽

Decision Process ◽

Process Models ◽

Sequential Decision Making ◽

Sequential Decision ◽

Exact Procedure ◽

Markov Decision ◽

Intuitive Idea ◽

First Time ◽

Maximum Regret

Markov Decision Process Models (MDPs) are a powerful tool for planning tasks and sequential decision-making issues. In this work we deal with MDPs with imprecise rewards, often used when dealing with situations where the data is uncertain. In this context, we provide algorithms for finding the policy that minimizes the maximum regret. To the best of our knowledge, all the regret-based methods proposed in the literature focus on providing an optimal stochastic policy. We introduce for the first time a method to calculate an optimal deterministic policy using optimization approaches. Deterministic policies are easily interpretable for users because for a given state they provide a unique choice. To better motivate the use of an exact procedure for finding a deterministic policy, we show some (theoretical and experimental) cases where the intuitive idea of using a deterministic policy obtained after “determinizing” the optimal stochastic policy leads to a policy far from the exact deterministic policy.

Download Full-text

Intelligent decision making for overtaking maneuver using mixed observable Markov decision process

Journal of Intelligent Transportation Systems ◽

10.1080/15472450.2017.1334558 ◽

2017 ◽

Vol 22 (3) ◽

pp. 201-217 ◽

Cited By ~ 5

Author(s):

Volkan Sezer

Keyword(s):

Decision Making ◽

Markov Decision Process ◽

Decision Process ◽

Intelligent Decision Making ◽

Intelligent Decision ◽

Markov Decision

Download Full-text

An Evacuation Decision Making Model for Firefighters in Ad Hoc Robot Network

Advanced Materials Research ◽

10.4028/www.scientific.net/amr.756-759.504 ◽

2013 ◽

Vol 756-759 ◽

pp. 504-508

Author(s):

De Min Li ◽

Jian Zou ◽

Kai Kai Yue ◽

Hong Yun Guan ◽

Jia Cun Wang

Keyword(s):

Decision Making ◽

Markov Decision Process ◽

Decision Process ◽

Ad Hoc ◽

Decision Making Process ◽

Decision Method ◽

Markov Decision ◽

Critical Problems ◽

Fire Scene ◽

Decision Making Model

Evacuation for a firefighter in complex fire scene is challenge problem. In this paper, we discuss a firefighters evacuation decision making model in ad hoc robot network on fire scene. Due to the dynamics on fire scene, we know that the sensed information in ad hoc robot network is also dynamically variance. So in this paper, we adapt dynamic decision method, Markov decision process, to model the firefighters decision making process for evacuation from fire scene. In firefighting decision making process, we know that the critical problems are how to define action space and evaluate the transition law in Markov decision process. In this paper, we discuss those problems according to the triangular sensors situation in ad hoc robot network and describe a decision making model for a firefighters evacuation the in the end.

Download Full-text

Uniformization for semi-Markov decision processes under stationary policies

Journal of Applied Probability ◽

10.1017/s0021900200031375 ◽

1987 ◽

Vol 24 (03) ◽

pp. 644-656 ◽

Cited By ~ 4

Author(s):

Frederick J. Beutler ◽

Keith W. Ross

Keyword(s):

Markov Decision Processes ◽

Continuous Time ◽

Decision Process ◽

Decision Processes ◽

Stationary Processes ◽

Original Process ◽

Markov Decision ◽

Time Optimal ◽

Randomized Policies ◽

Average Rewards

Uniformization permits the replacement of a semi-Markov decision process (SMDP) by a Markov chain exhibiting the same average rewards for simple (non-randomized) policies. It is shown that various anomalies may occur, especially for stationary (randomized) policies; uniformization introduces virtual jumps with concomitant action changes not present in the original process. Since these lead to discrepancies in the average rewards for stationary processes, uniformization can be accepted as valid only for simple policies. We generalize uniformization to yield consistent results for stationary policies also. These results are applied to constrained optimization of SMDP, in which stationary (randomized) policies appear naturally. The structure of optimal constrained SMDP policies can then be elucidated by studying the corresponding controlled Markov chains. Moreover, constrained SMDP optimal policy computations can be more easily implemented in discrete time, the generalized uniformization being employed to relate discrete- and continuous-time optimal constrained policies.

Download Full-text

Rehabilitation of water networks: analysis of the decision making processes

Water Science & Technology Water Supply ◽

10.2166/ws.2005.0018 ◽

2005 ◽

Vol 5 (2) ◽

pp. 23-30

Author(s):

J.P. Torterotot ◽

M. Rebelo ◽

C. Werey ◽

J. Craveiro

Keyword(s):

Decision Making ◽

Decision Process ◽

Decision Processes ◽

Water Networks ◽

Process Data ◽

Water Pipes ◽

Decision Making Processes ◽

Operational Decision Making ◽

Networks Analysis ◽

Institutional Stakeholders

The European Project CARE-W (Computer Aided Rehabilitation of Water Networks), which is supported by the European Commission, has created and tested a prototype of decision support system for the rehabilitation of water pipes. Inside the project, the present operational decision making processes have been analysed in 14 water utilities. The objectives were to identify the involved actors and their interactions, as well as the structure (formal and non formal) of the decision processes: institutional and regulatory contexts, steps of decision making, information fluxes, sharing out of responsibilities and of influence, participation of social and institutional stakeholders. Synthetic results are presented. The cases studied are diversified on several aspects. An “average” situation could be described as showing a moderate level of confrontation, with rather formalised procedures, and very centralised decision making out of the interrelations with road works programming. The highest diversity encountered among the utilities concerns the level of information inside the decision process: data considered, fluxes of information, “sophistication” of criteria taken into account.

Download Full-text

Effects of stochastic interest rates in decision making under risk: A Markov decision process model for forest management

Forest Policy and Economics ◽

10.1016/j.forpol.2011.03.007 ◽

2011 ◽

Vol 13 (5) ◽

pp. 402-410 ◽

Cited By ~ 15

Author(s):

Mo Zhou ◽

Joseph Buongiorno

Keyword(s):

Decision Making ◽

Forest Management ◽

Interest Rates ◽

Markov Decision Process ◽

Decision Process ◽

Process Model ◽

Stochastic Interest Rates ◽

Decision Making Under Risk ◽

Stochastic Interest ◽

Markov Decision

Download Full-text

Note on discounted continuous-time Markov decision processes with a lower bounding function

Journal of Applied Probability ◽

10.1017/jpr.2017.53 ◽

2017 ◽

Vol 54 (4) ◽

pp. 1071-1088

Author(s):

Xin Guo ◽

Alexey Piunovskiy ◽

Yi Zhang

Keyword(s):

Markov Decision Processes ◽

Continuous Time ◽

Decision Process ◽

Decision Processes ◽

Positive Part ◽

Negative Part ◽

Cost Rate ◽

Lower Bounding ◽

Markov Decision ◽

Unconstrained Problems

AbstractWe consider the discounted continuous-time Markov decision process (CTMDP), where the negative part of each cost rate is bounded by a drift function, sayw, whereas the positive part is allowed to be arbitrarily unbounded. Our focus is on the existence of a stationary optimal policy for the discounted CTMDP problems out of the more general class. Both constrained and unconstrained problems are considered. Our investigations are based on the continuous-time version of the Veinott transformation. This technique has not been widely employed in the previous literature on CTMDPs, but it clarifies the roles of the imposed conditions in a rather transparent way.

Download Full-text