Large scale system management based on Markov Decision Process and Big Data Concept

Reinforcement learning (RL) is a state or action value based machine learning method which approximately solves large-scale Markov Decision Process (MDP) or Semi-Markov Decision Process (SMDP). A multi-step RL algorithm called Sarsa(,k) is proposed, which is a compromised variation of Sarsa and Sarsa(). It is equivalent to Sarsa if k is 1 and is equivalent to Sarsa() if k is infinite. Sarsa(,k) adjust its performance by setting k value. Two forms of Sarsa(,k), forward view Sarsa(,k) and backward view Sarsa(,k), are constructed and proved equivalent in off-line updating.

Download Full-text

Modelling energy systems of Vietnam with integration of renewable power sources

10.47350/iccs-de.2019.04 ◽

2019 ◽

Cited By ~ 1

Author(s):

A.V. Edelev ◽

D.N. Karamov ◽

I.A. Sidorov ◽

D.V. Binh ◽

N.H. Nam ◽

...

Keyword(s):

Power Generation ◽

Renewable Energy ◽

Markov Decision Process ◽

Decision Process ◽

Large Scale ◽

Energy Systems ◽

Stochastic Search ◽

Power Sources ◽

Renewable Power ◽

Markov Decision

The paper addresses the research of the large-scale penetration of renewable energy into the power system of Vietnam. The proposed approach presents the optimization of operational decisions in different power generation technologies as a Markov decision process. It uses a stochastic base model that optimizes a deterministic lookahead model. The first model applies the stochastic search to optimize the operation of power sources. The second model captures hourly variations of renewable energy over a year. The approach helps to find the optimal generation configuration under different market conditions.

Download Full-text

FLOW SHOP SCHEDULING WITH REINFORCEMENT LEARNING

Asia Pacific Journal of Operational Research ◽

10.1142/s0217595913500140 ◽

2013 ◽

Vol 30 (05) ◽

pp. 1350014 ◽

Cited By ~ 2

Author(s):

ZHICONG ZHANG ◽

WEIPING WANG ◽

SHOUYAN ZHONG ◽

KAISHUN HU

Keyword(s):

Reinforcement Learning ◽

Markov Decision Process ◽

Decision Process ◽

Large Scale ◽

Flow Shop ◽

Flow Shop Scheduling ◽

Scheduling Problems ◽

Shop Scheduling ◽

Reward Function ◽

Markov Decision

Reinforcement learning (RL) is a state or action value based machine learning method which solves large-scale multi-stage decision problems such as Markov Decision Process (MDP) and Semi-Markov Decision Process (SMDP) problems. We minimize the makespan of flow shop scheduling problems with an RL algorithm. We convert flow shop scheduling problems into SMDPs by constructing elaborate state features, actions and the reward function. Minimizing the accumulated reward is equivalent to minimizing the schedule objective function. We apply on-line TD(λ) algorithm with linear gradient-descent function approximation to solve the SMDPs. To examine the performance of the proposed RL algorithm, computational experiments are conducted on benchmarking problems in comparison with other scheduling methods. The experimental results support the efficiency of the proposed algorithm and illustrate that the RL approach is a promising computational approach for flow shop scheduling problems worthy of further investigation.

Download Full-text

A Markov Decision Process Approach for Cost-Benefit Analysis of Infrastructure Resilience Upgrades

SSRN Electronic Journal ◽

10.2139/ssrn.3657479 ◽

2020 ◽

Author(s):

Qianru Zhu ◽

Benjamin D. Leibowicz

Keyword(s):

Markov Decision Process ◽

Decision Process ◽

Cost Benefit Analysis ◽

Cost Benefit ◽

Process Approach ◽

Benefit Analysis ◽

Markov Decision ◽

Infrastructure Resilience

Download Full-text

A Markov Decision Process Workflow for Automating Interior Design

KSCE Journal of Civil Engineering ◽

10.1007/s12205-021-1272-6 ◽

2021 ◽

Author(s):

Ebrahim Karan ◽

Sadegh Asgari ◽

Abbas Rashidi

Keyword(s):

Markov Decision Process ◽

Interior Design ◽

Decision Process ◽

Markov Decision

Download Full-text

A constraint partially observable semi-Markov decision process for the attack–defence relationships in various critical infrastructures

Cyber-Physical Systems ◽

10.1080/23335777.2021.1879935 ◽

2021 ◽

pp. 1-26

Author(s):

Nadia Niknami ◽

Jie Wu

Keyword(s):

Markov Decision Process ◽

Decision Process ◽

Critical Infrastructures ◽

Markov Decision ◽

Partially Observable

Download Full-text

Development of a Shipment Policy for Collection Centers

Mathematics ◽

10.3390/math9121385 ◽

2021 ◽

Vol 9 (12) ◽

pp. 1385

Author(s):

Irais Mora-Ochomogo ◽

Marco Serrato ◽

Jaime Mora-Vargas ◽

Raha Akhavan-Tabatabaei

Keyword(s):

Climate Change ◽

Natural Disasters ◽

Markov Decision Process ◽

Decision Process ◽

Necessary Conditions ◽

Decision Makers ◽

Humanitarian Organizations ◽

The World ◽

Markov Decision ◽

Unsatisfied Demand

Natural disasters represent a latent threat for every country in the world. Due to climate change and other factors, statistics show that they continue to be on the rise. This situation presents a challenge for the communities and the humanitarian organizations to be better prepared and react faster to natural disasters. In some countries, in-kind donations represent a high percentage of the supply for the operations, which presents additional challenges. This research proposes a Markov Decision Process (MDP) model to resemble operations in collection centers, where in-kind donations are received, sorted, packed, and sent to the affected areas. The decision addressed is when to send a shipment considering the uncertainty of the donations’ supply and the demand, as well as the logistics costs and the penalty of unsatisfied demand. As a result of the MDP a Monotone Optimal Non-Decreasing Policy (MONDP) is proposed, which provides valuable insights for decision-makers within this field. Moreover, the necessary conditions to prove the existence of such MONDP are presented.

Download Full-text