gradient methods Latest Research Papers

2022 ◽

Vol 2022 (1) ◽

Author(s):

Zabidin Salleh ◽

Adel Almarashi ◽

Ahmad Alhawarat

Keyword(s):

Global Convergence ◽

Conjugate Gradient Method ◽

Conjugate Gradient ◽

Gradient Method ◽

Convergence Property ◽

Gradient Methods ◽

Conjugate Gradient Methods ◽

Nonlinear Functions ◽

Descent Property ◽

Global Convergence Property

AbstractThe conjugate gradient method can be applied in many fields, such as neural networks, image restoration, machine learning, deep learning, and many others. Polak–Ribiere–Polyak and Hestenses–Stiefel conjugate gradient methods are considered as the most efficient methods to solve nonlinear optimization problems. However, both methods cannot satisfy the descent property or global convergence property for general nonlinear functions. In this paper, we present two new modifications of the PRP method with restart conditions. The proposed conjugate gradient methods satisfy the global convergence property and descent property for general nonlinear functions. The numerical results show that the new modifications are more efficient than recent CG methods in terms of number of iterations, number of function evaluations, number of gradient evaluations, and CPU time.

Download Full-text

Jointly Learning Environments and Control Policies with Projected Stochastic Gradient Ascent

Journal of Artificial Intelligence Research ◽

10.1613/jair.1.13350 ◽

2022 ◽

Vol 73 ◽

pp. 117-171

Author(s):

Adrien Bolland ◽

Ioannis Boukas ◽

Mathias Berger ◽

Damien Ernst

Keyword(s):

Reinforcement Learning ◽

Time Horizon ◽

Learning Algorithm ◽

Gradient Methods ◽

Optimization Techniques ◽

Small Scale ◽

Joint Design ◽

Gradient Ascent ◽

And Control ◽

Reinforcement Learning Algorithm

We consider the joint design and control of discrete-time stochastic dynamical systems over a finite time horizon. We formulate the problem as a multi-step optimization problem under uncertainty seeking to identify a system design and a control policy that jointly maximize the expected sum of rewards collected over the time horizon considered. The transition function, the reward function and the policy are all parametrized, assumed known and differentiable with respect to their parameters. We then introduce a deep reinforcement learning algorithm combining policy gradient methods with model-based optimization techniques to solve this problem. In essence, our algorithm iteratively approximates the gradient of the expected return via Monte-Carlo sampling and automatic differentiation and takes projected gradient ascent steps in the space of environment and policy parameters. This algorithm is referred to as Direct Environment and Policy Search (DEPS). We assess the performance of our algorithm in three environments concerned with the design and control of a mass-spring-damper system, a small-scale off-grid power system and a drone, respectively. In addition, our algorithm is benchmarked against a state-of-the-art deep reinforcement learning algorithm used to tackle joint design and control problems. We show that DEPS performs at least as well or better in all three environments, consistently yielding solutions with higher returns in fewer iterations. Finally, solutions produced by our algorithm are also compared with solutions produced by an algorithm that does not jointly optimize environment and policy parameters, highlighting the fact that higher returns can be achieved when joint optimization is performed.

Download Full-text

Gradient methods I

10.1201/9781315137360-5 ◽

2022 ◽

pp. 61-82

Author(s):

J J McKeown ◽

D Meegan ◽

D Sprevak

Keyword(s):

Gradient Methods

Download Full-text

Gradient methods II

10.1201/9781315137360-6 ◽

2022 ◽

pp. 83-104

Author(s):

J J McKeown ◽

D Meegan ◽

D Sprevak

Keyword(s):

Gradient Methods

Download Full-text

Optimizing Gradient Methods for IoT Applications

IEEE Internet of Things Journal ◽

10.1109/jiot.2022.3142200 ◽

2022 ◽

pp. 1-1

Author(s):

Eghbal Hosseini ◽

Line Reinhardt ◽

Danda B. Rawat

Keyword(s):

Gradient Methods ◽

Iot Applications

Download Full-text

Controlling Agents by Constrained Policy Updates

SYSTEM THEORY, CONTROL AND COMPUTING JOURNAL ◽

10.52846/stccj.2021.1.2.24 ◽

2021 ◽

Vol 1 (2) ◽

pp. 33-39

Author(s):

Mónika Farsang ◽

Luca Szegletes

Keyword(s):

Gradient Methods ◽

Poor Performance ◽

High Dimensional ◽

Complex Behavior ◽

Clear Trend ◽

Learned Behavior ◽

Optimal Behavior ◽

Policy Gradient ◽

Low Dimensional ◽

Policy Optimization

Learning the optimal behavior is the ultimate goal in reinforcement learning. This can be achieved by many different approaches, the most successful of them are policy gradient methods. However, they can suffer from undesirably large updates of policies, leading to poor performance. In recent years there has been a clear trend toward designing more reliable algorithms. This paper addresses to examine different restriction strategies applied to the widely used Proximal Policy Optimization (PPO-Clip) technique. We also question whether the analyzed methods are able to adapt not only to low-dimensional tasks but also to complex, high-dimensional problems in control and robotic domains. The analysis of the learned behavior shows that these methods can lead to better performance compared to the original PPO-Clip algorithm, moreover, they are also able to achieve complex behavior and policies in high-dimensional environments.

Download Full-text

Descent three-term DY-type conjugate gradient methods for constrained monotone equations with application

Computational and Applied Mathematics ◽

10.1007/s40314-021-01724-y ◽

2021 ◽

Vol 41 (1) ◽

Author(s):

Habibu Abdullahi ◽

A. K. Awasthi ◽

Mohammed Yusuf Waziri ◽

Abubakar Sani Halilu

Keyword(s):

Conjugate Gradient ◽

Gradient Methods ◽

Conjugate Gradient Methods ◽

Monotone Equations

Download Full-text

On the convergence rate of Fletcher‐Reeves nonlinear conjugate gradient methods satisfying strong Wolfe conditions: Application to parameter identification in problems governed by general dynamics

Mathematical Methods in the Applied Sciences ◽

10.1002/mma.8009 ◽

2021 ◽

Author(s):

Mohamed Kamel Riahi ◽

Issam A. Qattan

Keyword(s):

Convergence Rate ◽

Parameter Identification ◽

Conjugate Gradient ◽

Gradient Methods ◽

Conjugate Gradient Methods ◽

Wolfe Conditions ◽

General Dynamics ◽

Nonlinear Conjugate Gradient ◽

Nonlinear Conjugate Gradient Methods

Download Full-text

Two Descent Dai-Yuan Conjugate Gradient Methods for Systems of Monotone Nonlinear Equations

Journal of Scientific Computing ◽

10.1007/s10915-021-01713-7 ◽

2021 ◽

Vol 90 (1) ◽

Author(s):

Mohammed Yusuf Waziri ◽

Kabiru Ahmed

Keyword(s):

Nonlinear Equations ◽

Conjugate Gradient ◽

Gradient Methods ◽

Conjugate Gradient Methods

Download Full-text

Fast Global Convergence of Natural Policy Gradient Methods with Entropy Regularization

Operations Research ◽

10.1287/opre.2021.2151 ◽

2021 ◽

Author(s):

Shicong Cen ◽

Chen Cheng ◽

Yuxin Chen ◽

Yuting Wei ◽

Yuejie Chi

Keyword(s):

Reinforcement Learning ◽

Global Convergence ◽

Policy Evaluation ◽

Gradient Methods ◽

Convergence Result ◽

Learning Rates ◽

Wide Range ◽

Policy Gradient ◽

Markov Decision ◽

Policy Optimization

Preconditioning and Regularization Enable Faster Reinforcement Learning Natural policy gradient (NPG) methods, in conjunction with entropy regularization to encourage exploration, are among the most popular policy optimization algorithms in contemporary reinforcement learning. Despite the empirical success, the theoretical underpinnings for NPG methods remain severely limited. In “Fast Global Convergence of Natural Policy Gradient Methods with Entropy Regularization”, Cen, Cheng, Chen, Wei, and Chi develop nonasymptotic convergence guarantees for entropy-regularized NPG methods under softmax parameterization, focusing on tabular discounted Markov decision processes. Assuming access to exact policy evaluation, the authors demonstrate that the algorithm converges linearly at an astonishing rate that is independent of the dimension of the state-action space. Moreover, the algorithm is provably stable vis-à-vis inexactness of policy evaluation. Accommodating a wide range of learning rates, this convergence result highlights the role of preconditioning and regularization in enabling fast convergence.

Download Full-text

gradient methods
Recently Published Documents

TOTAL DOCUMENTS

H-INDEX

Two efficient modifications of AZPRP conjugate gradient method with sufficient descent property

Jointly Learning Environments and Control Policies with Projected Stochastic Gradient Ascent

Gradient methods I

Gradient methods II

Optimizing Gradient Methods for IoT Applications

Controlling Agents by Constrained Policy Updates

Descent three-term DY-type conjugate gradient methods for constrained monotone equations with application

On the convergence rate of Fletcher‐Reeves nonlinear conjugate gradient methods satisfying strong Wolfe conditions: Application to parameter identification in problems governed by general dynamics

Two Descent Dai-Yuan Conjugate Gradient Methods for Systems of Monotone Nonlinear Equations

Fast Global Convergence of Natural Policy Gradient Methods with Entropy Regularization

Export Citation Format

gradient methodsRecently Published Documents

TOTAL DOCUMENTS

H-INDEX

Two efficient modifications of AZPRP conjugate gradient method with sufficient descent property

Jointly Learning Environments and Control Policies with Projected Stochastic Gradient Ascent

Gradient methods I

Gradient methods II

Optimizing Gradient Methods for IoT Applications

Controlling Agents by Constrained Policy Updates

Descent three-term DY-type conjugate gradient methods for constrained monotone equations with application

On the convergence rate of Fletcher‐Reeves nonlinear conjugate gradient methods satisfying strong Wolfe conditions: Application to parameter identification in problems governed by general dynamics

Two Descent Dai-Yuan Conjugate Gradient Methods for Systems of Monotone Nonlinear Equations

Fast Global Convergence of Natural Policy Gradient Methods with Entropy Regularization

gradient methods
Recently Published Documents