Globalized Dual Heuristic Dynamic Programming in Control of Robotic Manipulator

2016 ◽  
Vol 817 ◽  
pp. 150-161 ◽  
Author(s):  
Marcin Szuster ◽  
Piotr Gierlak

The article focuses on the implementation of the globalized dual-heuristic dynamic programming algorithm in the discrete tracking control system of the three degrees of freedom robotic manipulator. The globalized dual-heuristic dynamic programming algorithm is included in the approximate dynamic programming algorithms family, that bases on the Bellman’s dynamic programming idea. These algorithms generally consist of the actor and the critic structures realized in a form of artificial neural networks. Moreover, the control system includes the PD controller, the supervisory term and an additional control signal. The structure of the supervisory term derives from the stability analysis, which was realized using the Lyapunov stability theorem. The control system works on-line and the neural networks’ weight adaptation process is realized in every iteration step. A series of computer simulations was realized in Matlab/Simulink software to confirm performance of the control system.

2014 ◽  
Vol 2014 ◽  
pp. 1-16 ◽  
Author(s):  
Marcin Szuster ◽  
Zenon Hendzel

Network-based control systems have been emerging technologies in the control of nonlinear systems over the past few years. This paper focuses on the implementation of the approximate dynamic programming algorithm in the network-based tracking control system of the two-wheeled mobile robot, Pioneer 2-DX. The proposed discrete tracking control system consists of the globalised dual heuristic dynamic programming algorithm, the PD controller, the supervisory term, and an additional control signal. The structure of the supervisory term derives from the stability analysis realised using the Lyapunov stability theorem. The globalised dual heuristic dynamic programming algorithm consists of two structures: the actor and the critic, realised in a form of neural networks. The actor generates the suboptimal control law, while the critic evaluates the realised control strategy by approximation of value function from the Bellman’s equation. The presented discrete tracking control system works online, the neural networks’ weights adaptation process is realised in every iteration step, and the neural networks preliminary learning procedure is not required. The performance of the proposed control system was verified by a series of computer simulations and experiments realised using the wheeled mobile robot Pioneer 2-DX.


Author(s):  
A. Meghdari ◽  
H. Sayyaadi

Abstract An optimization technique based on the well known Dynamic Programming Algorithm is applied to the motion control trajectories and path planning of multi-jointed fingers in dextrous hand designs. A three fingered hand with each finger containing four degrees of freedom is considered for analysis. After generating the kinematics and dynamics equations of such a hand, optimum values of the joints torques and velocities are computed such that the finger-tips of the hand are moved through their prescribed trajectories with the least time or/and energy to reach the object being grasped. Finally, optimal as well as feasible solutions for the multi-jointed fingers are identified and the results are presented.


2013 ◽  
Vol 210 ◽  
pp. 206-214
Author(s):  
Andrzej Burghardt ◽  
Marcin Szuster

This paper presents a new approach to the control problem of the ball and beam system, with a Neuro-Dynamic Programming algorithm implemented as the main part of the control system. The controlled system is included in the group of underactuated systems, which are nonlinear dynamical objects with the number of control signals smaller than the number of degrees of freedom. This results in problems in the formulation of a stable control algorithm, that guarantees stabilization of the ball in the desired position on the beam. The type of ball and beam material has a noticeable influence on the difficulties in stabilization of the ball, because of a smaller rolling friction and big inertia of the used metallic ball in comparison to other, for example made of non-metallic materials. The main part of the proposed discrete control system is the Neuro-Dynamic Programming algorithm in a Dual-Heuristic Dynamic Programming configuration, realized in a form of two neural networks: the actor and the critic. Neuro-Dynamic Programming algorithms use the Reinforcement Learning idea for adaptation of artificial neural network weights. Additional elements of the control system are the PD controller and the supervisory term, that ensures stability of the closed system loop. The control algorithm works on-line and does not require a preliminary learning phase of the neural network weights. Performance of the control algorithm was verified using the physical system controlled by the dSpace digital signal processing board.


Robotica ◽  
1992 ◽  
Vol 10 (5) ◽  
pp. 419-426 ◽  
Author(s):  
Ali Meghdari ◽  
Hassan Sayyaadi

SUMMARYAn optimization technique based on the well known Dynamic Programming Algorithm is applied to the motion control trajectories and path planning of multi-jointed fingers in dextrous hand designs. A three-fingered hand with each finger containing four degrees of freedom is considered for analysis. After generating the kinematics and dynamics equations of such a hand, optimum values of the joints torques and velocities are computed such that the finger-tips of the hand are moved through their prescribed trajectories with the least time or/and energy to reach the object being grasped. Finally, optimal as well as feasible solutions for the multi-jointed fingers are identified and the results are presented.


Sign in / Sign up

Export Citation Format

Share Document