Design and Comparison of Reinforcement-Learning-Based Time-Varying PID Controllers with Gain-Scheduled Actions
Keyword(s):
This paper presents innovative reinforcement learning methods for automatically tuning the parameters of a proportional integral derivative controller. Conventionally, the high dimension of the Q-table is a primary drawback when implementing a reinforcement learning algorithm. To overcome the obstacle, the idea underlying the n-armed bandit problem is used in this paper. Moreover, gain-scheduled actions are presented to tune the algorithms to improve the overall system behavior; therefore, the proposed controllers fulfill the multiple performance requirements. An experiment was conducted for the piezo-actuated stage to illustrate the effectiveness of the proposed control designs relative to competing algorithms.
2020 ◽
Vol 34
(04)
◽
pp. 6518-6525
2011 ◽
Vol 5
(18)
◽
pp. 2142-2155
◽
2018 ◽
Vol 140
(7)
◽
2012 ◽
Vol 45
(3)
◽
pp. 199-204
◽
Keyword(s):
2008 ◽
Vol 41
(2)
◽
pp. 14564-14569
◽
Keyword(s):