Negative binomial sums of random variables and discounted reward processes
Keyword(s):
Given a sequence of random variables (rewards), the Haviv–Puterman differential equation relates the expected infinite-horizon λ-discounted reward and the expected total reward up to a random time that is determined by an independent negative binomial random variable with parameters 2 and λ. This paper provides an interpretation of this proven, but previously unexplained, result. Furthermore, the interpretation is formalized into a new proof, which then yields new results for the general case where the rewards are accumulated up to a time determined by an independent negative binomial random variable with parameters k and λ.
Keyword(s):
2020 ◽
Vol 22
(4)
◽
pp. 415-421
2005 ◽
Vol 2005
(5)
◽
pp. 717-728
◽
2014 ◽
Vol 26
(2)
◽
pp. 931-947
◽
2019 ◽
Vol 2019
(1)
◽
2004 ◽
Vol 18
(4)
◽
pp. 473-484
◽