An Improved Reinforcement Learning Based Heuristic Dynamic Programming Algorithm for Model-Free Optimal Control

The article focuses on the implementation of the globalized dual-heuristic dynamic programming algorithm in the discrete tracking control system of the three degrees of freedom robotic manipulator. The globalized dual-heuristic dynamic programming algorithm is included in the approximate dynamic programming algorithms family, that bases on the Bellman’s dynamic programming idea. These algorithms generally consist of the actor and the critic structures realized in a form of artificial neural networks. Moreover, the control system includes the PD controller, the supervisory term and an additional control signal. The structure of the supervisory term derives from the stability analysis, which was realized using the Lyapunov stability theorem. The control system works on-line and the neural networks’ weight adaptation process is realized in every iteration step. A series of computer simulations was realized in Matlab/Simulink software to confirm performance of the control system.

Download Full-text

A new approximate dynamic programming algorithm based on an actor–critic framework for optimal control of alkali–surfactant–polymer flooding

Engineering Optimization ◽

10.1080/0305215x.2019.1570180 ◽

2019 ◽

Vol 51 (12) ◽

pp. 2147-2168

Author(s):

Shurong Li ◽

Lu Han ◽

Yulei Ge ◽

Yuhuan Shi

Keyword(s):

Optimal Control ◽

Dynamic Programming ◽

Approximate Dynamic Programming ◽

Dynamic Programming Algorithm ◽

Polymer Flooding ◽

Programming Algorithm

Download Full-text

A quasi-newton differential dynamic programming algorithm for discrete-time optimal control

Automatica ◽

10.1016/0005-1098(87)90031-8 ◽

1987 ◽

Vol 23 (6) ◽

pp. 749-752 ◽

Cited By ~ 15

Author(s):

S. Sen ◽

S.J. Yakowitz

Keyword(s):

Optimal Control ◽

Dynamic Programming ◽

Discrete Time ◽

Dynamic Programming Algorithm ◽

Time Optimal Control ◽

Programming Algorithm ◽

Differential Dynamic Programming ◽

Time Optimal ◽

Quasi Newton

Download Full-text

An iterative adaptive dynamic programming algorithm for optimal control of unknown discrete-time nonlinear systems with constrained inputs

Information Sciences ◽

10.1016/j.ins.2012.07.006 ◽

2013 ◽

Vol 220 ◽

pp. 331-342 ◽

Cited By ~ 94

Author(s):

Derong Liu ◽

Ding Wang ◽

Xiong Yang

Keyword(s):

Optimal Control ◽

Dynamic Programming ◽

Nonlinear Systems ◽

Discrete Time ◽

Dynamic Programming Algorithm ◽

Adaptive Dynamic Programming ◽

Programming Algorithm ◽

Adaptive Dynamic ◽

Constrained Inputs

Download Full-text

Discrete Globalised Dual Heuristic Dynamic Programming in Control of the Two-Wheeled Mobile Robot

Mathematical Problems in Engineering ◽

10.1155/2014/628798 ◽

2014 ◽

Vol 2014 ◽

pp. 1-16 ◽

Cited By ~ 5

Author(s):

Marcin Szuster ◽

Zenon Hendzel

Keyword(s):

Neural Networks ◽

Dynamic Programming ◽

Control System ◽

Mobile Robot ◽

Tracking Control ◽

Dynamic Programming Algorithm ◽

Wheeled Mobile Robot ◽

Programming Algorithm ◽

The Neural Networks ◽

Heuristic Dynamic Programming

Network-based control systems have been emerging technologies in the control of nonlinear systems over the past few years. This paper focuses on the implementation of the approximate dynamic programming algorithm in the network-based tracking control system of the two-wheeled mobile robot, Pioneer 2-DX. The proposed discrete tracking control system consists of the globalised dual heuristic dynamic programming algorithm, the PD controller, the supervisory term, and an additional control signal. The structure of the supervisory term derives from the stability analysis realised using the Lyapunov stability theorem. The globalised dual heuristic dynamic programming algorithm consists of two structures: the actor and the critic, realised in a form of neural networks. The actor generates the suboptimal control law, while the critic evaluates the realised control strategy by approximation of value function from the Bellman’s equation. The presented discrete tracking control system works online, the neural networks’ weights adaptation process is realised in every iteration step, and the neural networks preliminary learning procedure is not required. The performance of the proposed control system was verified by a series of computer simulations and experiments realised using the wheeled mobile robot Pioneer 2-DX.

Download Full-text

Data-driven iterative adaptive dynamic programming algorithm for approximate optimal control of unknown nonlinear systems

2014 International Joint Conference on Neural Networks (IJCNN) ◽

10.1109/ijcnn.2014.6889467 ◽

2014 ◽

Cited By ~ 5

Author(s):

Hongliang Li ◽

Derong Liu ◽

Ding Wang ◽

Chao Li

Keyword(s):

Optimal Control ◽

Dynamic Programming ◽

Nonlinear Systems ◽

Dynamic Programming Algorithm ◽

Adaptive Dynamic Programming ◽

Data Driven ◽

Programming Algorithm ◽

Adaptive Dynamic

Download Full-text

A Hybrid Differential Dynamic Programming Algorithm for Constrained Optimal Control Problems. Part 1: Theory

Journal of Optimization Theory and Applications ◽

10.1007/s10957-012-0039-0 ◽

2012 ◽

Vol 154 (2) ◽

pp. 382-417 ◽

Cited By ~ 33

Author(s):

Gregory Lantoine ◽

Ryan P. Russell

Keyword(s):

Optimal Control ◽

Dynamic Programming ◽

Optimal Control Problems ◽

Dynamic Programming Algorithm ◽

Programming Algorithm ◽

Control Problems ◽

Differential Dynamic Programming ◽

Constrained Optimal Control

Download Full-text

An Improved Heuristic-Dynamic Programming Algorithm for Rectangular Cutting Problem

Parallel Architectures, Algorithms and Programming - Communications in Computer and Information Science ◽

10.1007/978-981-15-2767-8_21 ◽

2020 ◽

pp. 221-233

Author(s):

Aihua Yin ◽

Chong Chen ◽

Dongping Hu ◽

Jianghai Huang ◽

Fan Yang

Keyword(s):

Dynamic Programming ◽

Dynamic Programming Algorithm ◽

Programming Algorithm ◽

Heuristic Dynamic Programming ◽

Cutting Problem

Download Full-text

Stability analysis of heuristic dynamic programming algorithm for nonlinear systems

Neurocomputing ◽

10.1016/j.neucom.2014.08.046 ◽

2015 ◽

Vol 149 ◽

pp. 1461-1468 ◽

Cited By ~ 6

Author(s):

Tao Feng ◽

Huaguang Zhang ◽

Yanhong Luo ◽

Jilie Zhang

Keyword(s):

Dynamic Programming ◽

Stability Analysis ◽

Nonlinear Systems ◽

Dynamic Programming Algorithm ◽

Programming Algorithm ◽

Heuristic Dynamic Programming

Download Full-text