An Extension of the Gradient Algorithm

Auxiliary Model-based Stochastic Gradient Algorithm for Multivariable Output Error Systems

ACTA AUTOMATICA SINICA ◽

10.3724/sp.j.1004.2010.00993 ◽

2010 ◽

Vol 36 (7) ◽

pp. 993-998 ◽

Cited By ~ 6

Author(s):

Feng DING ◽

Xiao-Ping LIU

Keyword(s):

Stochastic Gradient ◽

Gradient Algorithm ◽

Auxiliary Model ◽

Stochastic Gradient Algorithm ◽

Output Error ◽

Model Based

Download Full-text

Chaotic particle swarm optimization algorithm based on nonlinear conjugate gradient algorithm

Journal of Computer Applications ◽

10.3724/sp.j.1087.2009.03273 ◽

2010 ◽

Vol 29 (12) ◽

pp. 3273-3276

Author(s):

Hong-an CHEN ◽

Ying-jie ZHANG ◽

Jian-hui WU

Keyword(s):

Particle Swarm Optimization ◽

Optimization Algorithm ◽

Conjugate Gradient ◽

Particle Swarm Optimization Algorithm ◽

Particle Swarm ◽

Conjugate Gradient Algorithm ◽

Gradient Algorithm ◽

Swarm Optimization ◽

Nonlinear Conjugate Gradient ◽

Chaotic Particle Swarm Optimization

Download Full-text

Adaptive step-size natural gradient algorithm based on separating degree gradient

IET International Conference on Wireless Mobile and Multimedia Networks Proceedings (ICWMMN 2006) ◽

10.1049/cp:20061348 ◽

2006 ◽

Author(s):

Guangbiao Li ◽

Shimin Xu

Keyword(s):

Gradient Algorithm ◽

Natural Gradient ◽

Step Size ◽

Adaptive Step Size ◽

Adaptive Step

Download Full-text

Robust Cauchy Kernel Conjugate Gradient Algorithm for Non-Gaussian Noises

IEEE Signal Processing Letters ◽

10.1109/lsp.2021.3081381 ◽

2021 ◽

pp. 1-1

Author(s):

Letian Qi ◽

Minglin Shen ◽

Daili Wang ◽

Shiyuan Wang

Keyword(s):

Conjugate Gradient ◽

Conjugate Gradient Algorithm ◽

Gradient Algorithm ◽

Cauchy Kernel ◽

Non Gaussian

Download Full-text

Reinforcement-Learning-Based Asynchronous Formation Control Scheme for Multiple Unmanned Surface Vehicles

Applied Sciences ◽

10.3390/app11020546 ◽

2021 ◽

Vol 11 (2) ◽

pp. 546

Author(s):

Jiajia Xie ◽

Rui Zhou ◽

Yuan Liu ◽

Jun Luo ◽

Shaorong Xie ◽

...

Keyword(s):

Reinforcement Learning ◽

Formation Control ◽

Rapid Development ◽

Gradient Algorithm ◽

Robot System ◽

Physical Relationship ◽

Unmanned Surface Vehicles ◽

Main Challenge ◽

Control Scheme ◽

Multi Robot

The high performance and efficiency of multiple unmanned surface vehicles (multi-USV) promote the further civilian and military applications of coordinated USV. As the basis of multiple USVs’ cooperative work, considerable attention has been spent on developing the decentralized formation control of the USV swarm. Formation control of multiple USV belongs to the geometric problems of a multi-robot system. The main challenge is the way to generate and maintain the formation of a multi-robot system. The rapid development of reinforcement learning provides us with a new solution to deal with these problems. In this paper, we introduce a decentralized structure of the multi-USV system and employ reinforcement learning to deal with the formation control of a multi-USV system in a leader–follower topology. Therefore, we propose an asynchronous decentralized formation control scheme based on reinforcement learning for multiple USVs. First, a simplified USV model is established. Simultaneously, the formation shape model is built to provide formation parameters and to describe the physical relationship between USVs. Second, the advantage deep deterministic policy gradient algorithm (ADDPG) is proposed. Third, formation generation policies and formation maintenance policies based on the ADDPG are proposed to form and maintain the given geometry structure of the team of USVs during movement. Moreover, three new reward functions are designed and utilized to promote policy learning. Finally, various experiments are conducted to validate the performance of the proposed formation control scheme. Simulation results and contrast experiments demonstrate the efficiency and stability of the formation control scheme.

Download Full-text

On q-variant of Dai–Yuan conjugate gradient algorithm for unconstrained optimization problems

Nonlinear Dynamics ◽

10.1007/s11071-021-06378-3 ◽

2021 ◽

Author(s):

Shashi Kant Mishra ◽

Mohammad Esmael Samei ◽

Suvra Kanti Chakraborty ◽

Bhagwat Ram

Keyword(s):

Unconstrained Optimization ◽

Conjugate Gradient ◽

Optimization Problems ◽

Conjugate Gradient Algorithm ◽

Gradient Algorithm ◽

Unconstrained Optimization Problems

Download Full-text

Excitation Conditions for Uniform Exponential Stability of the Cooperative Gradient Algorithm over Weakly Connected Digraphs

IEEE Control Systems Letters ◽

10.1109/lcsys.2021.3049153 ◽

2021 ◽

pp. 1-1

Author(s):

Muhammad U. Javed ◽

Jorge I. Poveda ◽

Xudong Chen

Keyword(s):

Exponential Stability ◽

Gradient Algorithm ◽

Uniform Exponential Stability ◽

Excitation Conditions

Download Full-text

Heuristic Gait Learning of Quadruped Robot Based on Deep Deterministic Policy Gradient Algorithm

2020 Chinese Automation Congress (CAC) ◽

10.1109/cac51589.2020.9326973 ◽

2020 ◽

Author(s):

Mingchao Wang ◽

Xiaogang Ruan ◽

Xiaoqing Zhu

Keyword(s):

Quadruped Robot ◽

Gradient Algorithm ◽

Policy Gradient ◽

Gait Learning

Download Full-text

Safe option-critic: learning safety in the option-critic architecture

The Knowledge Engineering Review ◽

10.1017/s0269888921000035 ◽

2021 ◽

Vol 36 ◽

Author(s):

Arushi Jain ◽

Khimya Khetarpal ◽

Doina Precup

Keyword(s):

Model Uncertainty ◽

Gradient Algorithm ◽

Intrinsic Variability ◽

Expected Return ◽

Practical Applications ◽

Hierarchical Reinforcement Learning ◽

Continuous State ◽

End Conditions ◽

Policy Gradient ◽

High Uncertainty

Abstract Designing hierarchical reinforcement learning algorithms that exhibit safe behaviour is not only vital for practical applications but also facilitates a better understanding of an agent’s decisions. We tackle this problem in the options framework (Sutton, Precup & Singh, 1999), a particular way to specify temporally abstract actions which allow an agent to use sub-policies with start and end conditions. We consider a behaviour as safe that avoids regions of state space with high uncertainty in the outcomes of actions. We propose an optimization objective that learns safe options by encouraging the agent to visit states with higher behavioural consistency. The proposed objective results in a trade-off between maximizing the standard expected return and minimizing the effect of model uncertainty in the return. We propose a policy gradient algorithm to optimize the constrained objective function. We examine the quantitative and qualitative behaviours of the proposed approach in a tabular grid world, continuous-state puddle world, and three games from the Arcade Learning Environment: Ms. Pacman, Amidar, and Q*Bert. Our approach achieves a reduction in the variance of return, boosts performance in environments with intrinsic variability in the reward structure, and compares favourably both with primitive actions and with risk-neutral options.

Download Full-text

Multierror stochastic gradient algorithm for identification of a Hammerstein system with random noise and its application in the modeling of a continuous stirring tank reactor

Optimal Control Applications and Methods ◽

10.1002/oca.2760 ◽

2021 ◽

Author(s):

Shaoxue Jing

Keyword(s):

Random Noise ◽

Stochastic Gradient ◽

Gradient Algorithm ◽

Hammerstein System ◽

Tank Reactor ◽

Stochastic Gradient Algorithm

Download Full-text