A New Reinforcement Learning Algorithm With Fixed Exploration for Semi-Markov Control in Preventive Maintenance
Keyword(s):
Artificial intelligence techniques can play a significant role in solving problems encountered in the domain of Total Productive Maintenance (TPM). This paper considers a new reinforcement learning algorithm called iSMART, which can solve semi-Markov decision processes underlying control problems related to TPM. The algorithm uses a constant exploration rate, unlike its precursor R-SMART, which required exploration decay. Numerical experiments conducted here show encouraging behavior with the new algorithm.
Keyword(s):
2015 ◽
Vol 28
(4)
◽
pp. 1733-1744
◽
2017 ◽
pp. 768-777
Keyword(s):
Keyword(s):