Expected average cost: Dynamic Programming and LP

2021 ◽  
pp. 165-179
Author(s):  
Eitan Altman
1980 ◽  
Vol 17 (04) ◽  
pp. 996-1003
Author(s):  
D. R. Robinson

It is known that when costs are unbounded satisfaction of the appropriate dynamic programming ‘optimality' equation by a policy is not sufficient to guarantee its average optimality. A ‘lowest-order potential' condition is introduced which, along with the dynamic programming equation, is sufficient to establish the optimality of the policy. Also, it is shown that under fairly general conditions, if the lowest-order potential condition is not satisfied there exists a non-memoryless policy with smaller average cost than the policy satisfying the dynamic programming equation.


Author(s):  
Antonis C. Kakas ◽  
David Cohn ◽  
Sanjoy Dasgupta ◽  
Andrew G. Barto ◽  
Gail A. Carpenter ◽  
...  

Sign in / Sign up

Export Citation Format

Share Document