Globalized Dual Heuristic Dynamic Programming in Control of Robotic Manipulator

The article focuses on the implementation of the globalized dual-heuristic dynamic programming algorithm in the discrete tracking control system of the three degrees of freedom robotic manipulator. The globalized dual-heuristic dynamic programming algorithm is included in the approximate dynamic programming algorithms family, that bases on the Bellman’s dynamic programming idea. These algorithms generally consist of the actor and the critic structures realized in a form of artificial neural networks. Moreover, the control system includes the PD controller, the supervisory term and an additional control signal. The structure of the supervisory term derives from the stability analysis, which was realized using the Lyapunov stability theorem. The control system works on-line and the neural networks’ weight adaptation process is realized in every iteration step. A series of computer simulations was realized in Matlab/Simulink software to confirm performance of the control system.

Download Full-text

Discrete Globalised Dual Heuristic Dynamic Programming in Control of the Two-Wheeled Mobile Robot

Mathematical Problems in Engineering ◽

10.1155/2014/628798 ◽

2014 ◽

Vol 2014 ◽

pp. 1-16 ◽

Cited By ~ 5

Author(s):

Marcin Szuster ◽

Zenon Hendzel

Keyword(s):

Neural Networks ◽

Dynamic Programming ◽

Control System ◽

Mobile Robot ◽

Tracking Control ◽

Dynamic Programming Algorithm ◽

Wheeled Mobile Robot ◽

Programming Algorithm ◽

The Neural Networks ◽

Heuristic Dynamic Programming

Network-based control systems have been emerging technologies in the control of nonlinear systems over the past few years. This paper focuses on the implementation of the approximate dynamic programming algorithm in the network-based tracking control system of the two-wheeled mobile robot, Pioneer 2-DX. The proposed discrete tracking control system consists of the globalised dual heuristic dynamic programming algorithm, the PD controller, the supervisory term, and an additional control signal. The structure of the supervisory term derives from the stability analysis realised using the Lyapunov stability theorem. The globalised dual heuristic dynamic programming algorithm consists of two structures: the actor and the critic, realised in a form of neural networks. The actor generates the suboptimal control law, while the critic evaluates the realised control strategy by approximation of value function from the Bellman’s equation. The presented discrete tracking control system works online, the neural networks’ weights adaptation process is realised in every iteration step, and the neural networks preliminary learning procedure is not required. The performance of the proposed control system was verified by a series of computer simulations and experiments realised using the wheeled mobile robot Pioneer 2-DX.

Download Full-text

Optimization in Trajectory Planning of Multi-Jointed Fingers in Dextrous Hand Designs

ASME 1991 Computers in Engineering Conference: Volume 2 — Finite Elements/Computational Geometry; Computers in Education; Robotics and Controls ◽

10.1115/cie1991-0160 ◽

1991 ◽

Author(s):

A. Meghdari ◽

H. Sayyaadi

Keyword(s):

Dynamic Programming ◽

Motion Control ◽

Trajectory Planning ◽

Degrees Of Freedom ◽

Optimization Technique ◽

Dynamic Programming Algorithm ◽

Programming Algorithm ◽

Kinematics And Dynamics ◽

Feasible Solutions ◽

Dextrous Hand

Abstract An optimization technique based on the well known Dynamic Programming Algorithm is applied to the motion control trajectories and path planning of multi-jointed fingers in dextrous hand designs. A three fingered hand with each finger containing four degrees of freedom is considered for analysis. After generating the kinematics and dynamics equations of such a hand, optimum values of the joints torques and velocities are computed such that the finger-tips of the hand are moved through their prescribed trajectories with the least time or/and energy to reach the object being grasped. Finally, optimal as well as feasible solutions for the multi-jointed fingers are identified and the results are presented.

Download Full-text

NONLINEAR CONTROL SYSTEM DESIGN BASED ON NEWLY DEVELOPED DYNAMIC PROGRAMMING ALGORITHM

Design Methods of Control Systems ◽

10.1016/b978-0-08-041902-2.50064-7 ◽

1992 ◽

pp. 353-358

Author(s):

T. Hanaoka

Keyword(s):

Dynamic Programming ◽

Control System ◽

Nonlinear Control ◽

System Design ◽

Dynamic Programming Algorithm ◽

Nonlinear Control System ◽

Programming Algorithm ◽

Control System Design

Download Full-text

Nonlinear Control System Design Based on Newly Developed Dynamic Programming Algorithm

IFAC Proceedings Volumes ◽

10.1016/s1474-6670(17)54195-6 ◽

1991 ◽

Vol 24 (8) ◽

pp. 353-358

Author(s):

T. Hanaoka

Keyword(s):

Dynamic Programming ◽

Control System ◽

Nonlinear Control ◽

System Design ◽

Dynamic Programming Algorithm ◽

Nonlinear Control System ◽

Programming Algorithm ◽

Control System Design

Download Full-text

An Improved Heuristic-Dynamic Programming Algorithm for Rectangular Cutting Problem

Parallel Architectures, Algorithms and Programming - Communications in Computer and Information Science ◽

10.1007/978-981-15-2767-8_21 ◽

2020 ◽

pp. 221-233

Author(s):

Aihua Yin ◽

Chong Chen ◽

Dongping Hu ◽

Jianghai Huang ◽

Fan Yang

Keyword(s):

Dynamic Programming ◽

Dynamic Programming Algorithm ◽

Programming Algorithm ◽

Heuristic Dynamic Programming ◽

Cutting Problem

Download Full-text

An Improved Reinforcement Learning Based Heuristic Dynamic Programming Algorithm for Model-Free Optimal Control

Artificial Neural Networks and Machine Learning – ICANN 2020 - Lecture Notes in Computer Science ◽

10.1007/978-3-030-61616-8_23 ◽

2020 ◽

pp. 282-294

Author(s):

Jia Li ◽

Zhaolin Yuan ◽

Xiaojuan Ban

Keyword(s):

Optimal Control ◽

Dynamic Programming ◽

Reinforcement Learning ◽

Dynamic Programming Algorithm ◽

Programming Algorithm ◽

Model Free ◽

Heuristic Dynamic Programming

Download Full-text

Stability analysis of heuristic dynamic programming algorithm for nonlinear systems

Neurocomputing ◽

10.1016/j.neucom.2014.08.046 ◽

2015 ◽

Vol 149 ◽

pp. 1461-1468 ◽

Cited By ~ 6

Author(s):

Tao Feng ◽

Huaguang Zhang ◽

Yanhong Luo ◽

Jilie Zhang

Keyword(s):

Dynamic Programming ◽

Stability Analysis ◽

Nonlinear Systems ◽

Dynamic Programming Algorithm ◽

Programming Algorithm ◽

Heuristic Dynamic Programming

Download Full-text

Neuro-Dynamic Programming in Control of the Ball and Beam System

Solid State Phenomena ◽

10.4028/www.scientific.net/ssp.210.206 ◽

2013 ◽

Vol 210 ◽

pp. 206-214

Author(s):

Andrzej Burghardt ◽

Marcin Szuster

Keyword(s):

Neural Network ◽

Dynamic Programming ◽

Control System ◽

Control Algorithm ◽

Main Part ◽

Dynamic Programming Algorithm ◽

Discrete Control ◽

Programming Algorithm ◽

Beam System ◽

Ball And Beam System

This paper presents a new approach to the control problem of the ball and beam system, with a Neuro-Dynamic Programming algorithm implemented as the main part of the control system. The controlled system is included in the group of underactuated systems, which are nonlinear dynamical objects with the number of control signals smaller than the number of degrees of freedom. This results in problems in the formulation of a stable control algorithm, that guarantees stabilization of the ball in the desired position on the beam. The type of ball and beam material has a noticeable influence on the difficulties in stabilization of the ball, because of a smaller rolling friction and big inertia of the used metallic ball in comparison to other, for example made of non-metallic materials. The main part of the proposed discrete control system is the Neuro-Dynamic Programming algorithm in a Dual-Heuristic Dynamic Programming configuration, realized in a form of two neural networks: the actor and the critic. Neuro-Dynamic Programming algorithms use the Reinforcement Learning idea for adaptation of artificial neural network weights. Additional elements of the control system are the PD controller and the supervisory term, that ensures stability of the closed system loop. The control algorithm works on-line and does not require a preliminary learning phase of the neural network weights. Performance of the control algorithm was verified using the physical system controlled by the dSpace digital signal processing board.

Download Full-text

Optimizing motion trajectories of dextrous fingers by dynamic programming technique

Robotica ◽

10.1017/s0263574700010626 ◽

1992 ◽

Vol 10 (5) ◽

pp. 419-426 ◽

Cited By ~ 3

Author(s):

Ali Meghdari ◽

Hassan Sayyaadi

Keyword(s):

Dynamic Programming ◽

Motion Control ◽

Degrees Of Freedom ◽

Optimization Technique ◽

Dynamic Programming Algorithm ◽

Programming Algorithm ◽

Programming Technique ◽

Motion Trajectories ◽

Feasible Solutions ◽

Dextrous Hand

SUMMARYAn optimization technique based on the well known Dynamic Programming Algorithm is applied to the motion control trajectories and path planning of multi-jointed fingers in dextrous hand designs. A three-fingered hand with each finger containing four degrees of freedom is considered for analysis. After generating the kinematics and dynamics equations of such a hand, optimum values of the joints torques and velocities are computed such that the finger-tips of the hand are moved through their prescribed trajectories with the least time or/and energy to reach the object being grasped. Finally, optimal as well as feasible solutions for the multi-jointed fingers are identified and the results are presented.

Download Full-text