Simulation of partially observed Markov decision process and dynamic quality improvement

Computers & Industrial Engineering ◽

10.1016/s0360-8352(97)00137-x ◽

1997 ◽

Vol 32 (4) ◽

pp. 691-700 ◽

Author(s):

Nancy Gautreau ◽

Soumaya Yacout ◽

Réjean Hall

Keyword(s):

Quality Improvement ◽

Markov Decision Process ◽

Decision Process ◽

Partially Observed ◽

Markov Decision ◽

Dynamic Quality

Download Full-text

Structured Threshold Policies for Dynamic Sensor Scheduling—A Partially Observed Markov Decision Process Approach

IEEE Transactions on Signal Processing ◽

10.1109/tsp.2007.897908 ◽

2007 ◽

Vol 55 (10) ◽

pp. 4938-4957 ◽

Author(s):

Vikram Krishnamurthy ◽

Dejan V. Djonin

Keyword(s):

Markov Decision Process ◽

Decision Process ◽

Process Approach ◽

Sensor Scheduling ◽

Partially Observed ◽

Markov Decision ◽

Threshold Policies

Download Full-text

Season-Dependent Condition-Based Maintenance for a Wind Turbine Using a Partially Observed Markov Decision Process

IEEE Transactions on Power Systems ◽

10.1109/tpwrs.2010.2043269 ◽

2010 ◽

Vol 25 (4) ◽

pp. 1823-1834 ◽

Author(s):

Eunshin Byon ◽

Yu Ding

Keyword(s):

Wind Turbine ◽

Markov Decision Process ◽

Decision Process ◽

Condition Based Maintenance ◽

Partially Observed ◽

Markov Decision

Download Full-text

Partially Observed Markov Decision Process Multiarmed Bandits—Structural Results

Mathematics of Operations Research ◽

10.1287/moor.1080.0371 ◽

2009 ◽

Vol 34 (2) ◽

pp. 287-302 ◽

Author(s):

Vikram Krishnamurthy ◽

Bo Wahlberg

Keyword(s):

Markov Decision Process ◽

Decision Process ◽

Partially Observed ◽

Markov Decision

Download Full-text

A survey of solution techniques for the partially observed Markov decision process

Annals of Operations Research ◽

10.1007/bf02204836 ◽

1991 ◽

Vol 32 (1) ◽

pp. 215-230 ◽

Author(s):

Chelsea C. White

Keyword(s):

Markov Decision Process ◽

Decision Process ◽

Partially Observed ◽

Markov Decision ◽

Solution Techniques

Download Full-text

Surveillance Imaging for Patients with Head and Neck Cancer Treated with Definitive Radiotherapy: A Partially Observed Markov Decision Process Model

International Journal of Radiation Oncology*Biology*Physics ◽

10.1016/j.ijrobp.2019.11.234 ◽

2020 ◽

Vol 106 (5) ◽

pp. 1154

Author(s):

S.P. Ng ◽

T. Ajayi ◽

A.J. Schaefer ◽

C. Pollard ◽

H. Bahig ◽

...

Keyword(s):

Head And Neck Cancer ◽

Head And Neck ◽

Markov Decision Process ◽

Neck Cancer ◽

Decision Process ◽

Process Model ◽

Definitive Radiotherapy ◽

Surveillance Imaging ◽

Partially Observed ◽

Markov Decision

Download Full-text

Surveillance imaging for patients with head and neck cancer treated with definitive radiotherapy: A partially observed Markov decision process model

Cancer ◽

10.1002/cncr.32597 ◽

2019 ◽

Vol 126 (4) ◽

pp. 749-756 ◽

Author(s):

Sweet Ping Ng ◽

Temitayo Ajayi ◽

Andrew J. Schaefer ◽

Courtney Pollard ◽

Houda Bahig ◽

...

Keyword(s):

Head And Neck Cancer ◽

Head And Neck ◽

Markov Decision Process ◽

Neck Cancer ◽

Decision Process ◽

Process Model ◽

Definitive Radiotherapy ◽

Surveillance Imaging ◽

Partially Observed ◽

Markov Decision

Download Full-text

Exploitation-Oriented Learning PS-r#

Journal of Advanced Computational Intelligence and Intelligent Informatics ◽

10.20965/jaciii.2009.p0624 ◽

2009 ◽

Vol 13 (6) ◽

pp. 624-630 ◽

Author(s):

Kazuteru Miyazaki ◽

◽

Shigenobu Kobayashi ◽

Keyword(s):

Random Walk ◽

Reinforcement Learning ◽

Markov Decision Process ◽

Decision Process ◽

Sensory Input ◽

Numerical Examples ◽

Novel Approach ◽

Partially Observed ◽

Markov Decision ◽

Directed Learning

Exploitation-oriented learning (XoL) is a novel approach to goal-directed learning from interaction. Reinforcement learning is much more focused on learning and ensures optimality in Markov decision process (MDP) environments, XoL involves learning a rational policy that obtains rewards continuously and very quickly. PS-r*, a form of XoL, involves learning a useful rational policy not inferior to the random walk in the partially observed Markov decision process (POMDP) where reward types number one. PS-r*, however, requires O(MN2) memory where N is the number of sensory input types and M is an action. We propose PS-r#for learning a useful rational policy in the POMDP using O(MN) memory. PS-r#effectiveness is confirmed in numerical examples.

Download Full-text

An effective buffer management in delay tolerant networks based on a partially observed Markov decision process framework

International Journal of Communication Networks and Distributed Systems ◽

10.1504/ijcnds.2016.077667 ◽

2016 ◽

Vol 16 (4) ◽

pp. 368 ◽

Author(s):

Imane Rahmouni ◽

Mohamed El Kamili ◽

Mohammed Raiss El Fenni ◽

Lahcen Omari

Keyword(s):

Markov Decision Process ◽

Decision Process ◽

Buffer Management ◽

Delay Tolerant Networks ◽

Partially Observed ◽

Process Framework ◽

Markov Decision ◽

Delay Tolerant ◽

Effective Buffer

Download Full-text

A Partially Observed Markov Decision Process for Dynamic Pricing

Management Science ◽

10.1287/mnsc.1050.0393 ◽

2005 ◽

Vol 51 (9) ◽

pp. 1400-1416 ◽

Author(s):

Yossi Aviv ◽

Amit Pazgal

Keyword(s):

Markov Decision Process ◽

Dynamic Pricing ◽

Decision Process ◽

Partially Observed ◽

Markov Decision

Download Full-text

A Markov Decision Process Approach for Cost-Benefit Analysis of Infrastructure Resilience Upgrades

SSRN Electronic Journal ◽

10.2139/ssrn.3657479 ◽

2020 ◽

Author(s):

Qianru Zhu ◽

Benjamin D. Leibowicz

Keyword(s):

Markov Decision Process ◽

Decision Process ◽

Cost Benefit Analysis ◽

Cost Benefit ◽

Process Approach ◽

Benefit Analysis ◽

Markov Decision ◽

Infrastructure Resilience

Download Full-text