An Extension of Finite-state Markov Decision Process and an Application of Grammatical Inference

AbstractBertrand et al. introduced a model of parameterised systems, where each agent is represented by a finite state system, and studied the following control problem: for any number of agents, does there exist a controller able to bring all agents to a target state? They showed that the problem is decidable and EXPTIME-complete in the adversarial setting, and posed as an open problem the stochastic setting, where the agent is represented by a Markov decision process. In this paper, we show that the stochastic control problem is decidable. Our solution makes significant uses of well quasi orders, of the max-flow min-cut theorem, and of the theory of regular cost functions.

Download Full-text

A Markov Decision Process Approach for Cost-Benefit Analysis of Infrastructure Resilience Upgrades

SSRN Electronic Journal ◽

10.2139/ssrn.3657479 ◽

2020 ◽

Author(s):

Qianru Zhu ◽

Benjamin D. Leibowicz

Keyword(s):

Markov Decision Process ◽

Decision Process ◽

Cost Benefit Analysis ◽

Cost Benefit ◽

Process Approach ◽

Benefit Analysis ◽

Markov Decision ◽

Infrastructure Resilience

Download Full-text

A Markov Decision Process Workflow for Automating Interior Design

KSCE Journal of Civil Engineering ◽

10.1007/s12205-021-1272-6 ◽

2021 ◽

Author(s):

Ebrahim Karan ◽

Sadegh Asgari ◽

Abbas Rashidi

Keyword(s):

Markov Decision Process ◽

Interior Design ◽

Decision Process ◽

Markov Decision

Download Full-text

A constraint partially observable semi-Markov decision process for the attack–defence relationships in various critical infrastructures

Cyber-Physical Systems ◽

10.1080/23335777.2021.1879935 ◽

2021 ◽

pp. 1-26

Author(s):

Nadia Niknami ◽

Jie Wu

Keyword(s):

Markov Decision Process ◽

Decision Process ◽

Critical Infrastructures ◽

Markov Decision ◽

Partially Observable

Download Full-text

Development of a Shipment Policy for Collection Centers

Mathematics ◽

10.3390/math9121385 ◽

2021 ◽

Vol 9 (12) ◽

pp. 1385

Author(s):

Irais Mora-Ochomogo ◽

Marco Serrato ◽

Jaime Mora-Vargas ◽

Raha Akhavan-Tabatabaei

Keyword(s):

Climate Change ◽

Natural Disasters ◽

Markov Decision Process ◽

Decision Process ◽

Necessary Conditions ◽

Decision Makers ◽

Humanitarian Organizations ◽

The World ◽

Markov Decision ◽

Unsatisfied Demand

Natural disasters represent a latent threat for every country in the world. Due to climate change and other factors, statistics show that they continue to be on the rise. This situation presents a challenge for the communities and the humanitarian organizations to be better prepared and react faster to natural disasters. In some countries, in-kind donations represent a high percentage of the supply for the operations, which presents additional challenges. This research proposes a Markov Decision Process (MDP) model to resemble operations in collection centers, where in-kind donations are received, sorted, packed, and sent to the affected areas. The decision addressed is when to send a shipment considering the uncertainty of the donations’ supply and the demand, as well as the logistics costs and the penalty of unsatisfied demand. As a result of the MDP a Monotone Optimal Non-Decreasing Policy (MONDP) is proposed, which provides valuable insights for decision-makers within this field. Moreover, the necessary conditions to prove the existence of such MONDP are presented.

Download Full-text

An Optimal Life Cycle Reprofiling Strategy of Train Wheels Based on Markov Decision Process of Wheel Degradation

IEEE Transactions on Intelligent Transportation Systems ◽

10.1109/tits.2021.3093019 ◽

2021 ◽

pp. 1-11

Author(s):

Yuanchen Zeng ◽

Dongli Song ◽

Weihua Zhang ◽

Bin Zhou ◽

Mingyuan Xie ◽

...

Keyword(s):

Life Cycle ◽

Markov Decision Process ◽

Decision Process ◽

Markov Decision

Download Full-text

Computational complexity reduction algorithms for Markov decision process based vertical handoff in mobile networks

International Journal of Communication Systems ◽

10.1002/dac.4938 ◽

2021 ◽

Author(s):

Rida Gillani ◽

Ali Nasir

Keyword(s):

Computational Complexity ◽

Markov Decision Process ◽

Mobile Networks ◽

Decision Process ◽

Complexity Reduction ◽

Vertical Handoff ◽

Markov Decision

Download Full-text

Optimal Control of Boolean Control Networks with Discounted Cost: An Efficient Approach based on Deterministic Markov Decision Process

2020 IEEE 16th International Conference on Control & Automation (ICCA) ◽

10.1109/icca51439.2020.9264464 ◽

2020 ◽

Author(s):

Shuhua Gao ◽

Cheng Xiang ◽

Tong Heng Lee

Keyword(s):

Optimal Control ◽

Markov Decision Process ◽

Decision Process ◽

Discounted Cost ◽

Control Networks ◽

Efficient Approach ◽

Markov Decision

Download Full-text

Dynamic Task Migration Combining Energy Efficiency and Load Balancing Optimization in Three-Tier UAV-Enabled Mobile Edge Computing System

Electronics ◽

10.3390/electronics10020190 ◽

2021 ◽

Vol 10 (2) ◽

pp. 190

Author(s):

Wu Ouyang ◽

Zhigang Chen ◽

Jia Wu ◽

Genghua Yu ◽

Heng Zhang

Keyword(s):

Energy Consumption ◽

Load Balancing ◽

Markov Decision Process ◽

Decision Process ◽

Base Station ◽

Edge Computing ◽

Mobile Edge Computing ◽

Task Migration ◽

Migration Strategy ◽

Markov Decision

As transportation becomes more convenient and efficient, users move faster and faster. When a user leaves the service range of the original edge server, the original edge server needs to migrate the tasks offloaded by the user to other edge servers. An effective task migration strategy needs to fully consider the location of users, the load status of edge servers, and energy consumption, which make designing an effective task migration strategy a challenge. In this paper, we innovatively proposed a mobile edge computing (MEC) system architecture consisting of multiple smart mobile devices (SMDs), multiple unmanned aerial vehicle (UAV), and a base station (BS). Moreover, we establish the model of the Markov decision process with unknown rewards (MDPUR) based on the traditional Markov decision process (MDP), which comprehensively considers the three aspects of the migration distance, the residual energy status of the UAVs, and the load status of the UAVs. Based on the MDPUR model, we propose a advantage-based value iteration (ABVI) algorithm to obtain the effective task migration strategy, which can help the UAV group to achieve load balancing and reduce the total energy consumption of the UAV group under the premise of ensuring user service quality. Finally, the results of simulation experiments show that the ABVI algorithm is effective. In particular, the ABVI algorithm has better performance than the traditional value iterative algorithm. And in a dynamic environment, the ABVI algorithm is also very robust.

Download Full-text