Some Essential Programming Algorithms

1997 ◽  
pp. 471-532
Author(s):  
David R. Brooks
Author(s):  
Ming-Sheng Ying ◽  
Yuan Feng ◽  
Sheng-Gang Ying

AbstractMarkov decision process (MDP) offers a general framework for modelling sequential decision making where outcomes are random. In particular, it serves as a mathematical framework for reinforcement learning. This paper introduces an extension of MDP, namely quantum MDP (qMDP), that can serve as a mathematical model of decision making about quantum systems. We develop dynamic programming algorithms for policy evaluation and finding optimal policies for qMDPs in the case of finite-horizon. The results obtained in this paper provide some useful mathematical tools for reinforcement learning techniques applied to the quantum world.


Author(s):  
Mohammad Mahdi Javanmard ◽  
Zafar Ahmad ◽  
Jaroslaw Zola ◽  
Louis-Noel Pouchet ◽  
Rezaul Chowdhury ◽  
...  

Author(s):  
José L. Montaña ◽  
César L. Alonso ◽  
Cruz Enrique Borges ◽  
Javier de la Dehesa

Sign in / Sign up

Export Citation Format

Share Document