INITIAL IMPROVEMENT OF THE HYBRID ACCELERATED GRADIENT DESCENT PROCESS

We improve the convergence properties of the iterative scheme for solving unconstrained optimisation problems introduced in Petrovic et al. [‘Hybridization of accelerated gradient descent method’, Numer. Algorithms (2017), doi:10.1007/s11075-017-0460-4] by optimising the value of the initial step length parameter in the backtracking line search procedure. We prove the validity of the algorithm and illustrate its advantages by numerical experiments and comparisons.

Download Full-text

Multiplicative parameters in gradient descent methods

Filomat ◽

10.2298/fil0903023s ◽

2009 ◽

Vol 23 (3) ◽

pp. 23-36 ◽

Cited By ~ 3

Author(s):

Predrag Stanimirovic ◽

Marko Miladinovic ◽

Snezana Djordjevic

Keyword(s):

Gradient Descent ◽

Line Search ◽

Main Idea ◽

Step Length ◽

Descent Method ◽

Gradient Descent Method ◽

Calculation Algorithm ◽

Modified Newton Method ◽

Backtracking Line Search ◽

Algorithm Construction

We introduced an algorithm for unconstrained optimization based on the reduction of the modified Newton method with line search into a gradient descent method. Main idea used in the algorithm construction is approximation of Hessian by a diagonal matrix. The step length calculation algorithm is based on the Taylor's development in two successive iterative points and the backtracking line search procedure.

Download Full-text

A Transformation of Accelerated Double Step Size Method for Unconstrained Optimization

Mathematical Problems in Engineering ◽

10.1155/2015/283679 ◽

2015 ◽

Vol 2015 ◽

pp. 1-8 ◽

Cited By ~ 4

Author(s):

Predrag S. Stanimirović ◽

Gradimir V. Milovanović ◽

Milena J. Petrović ◽

Nataša Z. Kontrec

Keyword(s):

Gradient Descent ◽

Linear Convergence ◽

Step Length ◽

Descent Method ◽

Single Step ◽

Gradient Descent Method ◽

Step Size ◽

Double Step ◽

Substantial Progress ◽

Accelerated Gradient

A reduction of the originally double step size iteration into the single step length scheme is derived under the proposed condition that relates two step lengths in the accelerated double step size gradient descent scheme. The proposed transformation is numerically tested. Obtained results confirm the substantial progress in comparison with the single step size accelerated gradient descent method defined in a classical way regarding all analyzed characteristics: number of iterations, CPU time, and number of function evaluations. Linear convergence of derived method has been proved.

Download Full-text

Image Deblurring Algorithm Using ACCELERATED Gradient Descent Method

2018 IEEE 4th International Conference on Computer and Communications (ICCC) ◽

10.1109/compcomm.2018.8780969 ◽

2018 ◽

Author(s):

Sreenivas Sasubilli ◽

Kumar Attangudi Perichiappan Perichappan ◽

Hayk Sargsyan

Keyword(s):

Gradient Descent ◽

Image Deblurring ◽

Descent Method ◽

Gradient Descent Method ◽

Accelerated Gradient

Download Full-text

Accelerated Gradient Descent Method for Projections onto the $\ell_{1}$ -Ball

2018 IEEE 13th Image, Video, and Multidimensional Signal Processing Workshop (IVMSP) ◽

10.1109/ivmspw.2018.8448778 ◽

2018 ◽

Author(s):

Paul Rodriguez

Keyword(s):

Gradient Descent ◽

Descent Method ◽

Gradient Descent Method ◽

Accelerated Gradient

Download Full-text

Adaptive Natural Gradient Method for Learning of Stochastic Neural Networks in Mini-Batch Mode

Applied Sciences ◽

10.3390/app9214568 ◽

2019 ◽

Vol 9 (21) ◽

pp. 4568

Author(s):

Hyeyoung Park ◽

Kwanyong Lee

Keyword(s):

Neural Networks ◽

Gradient Descent ◽

Learning Algorithm ◽

Descent Method ◽

Benchmark Problems ◽

Stochastic Neural Networks ◽

Gradient Descent Method ◽

Natural Gradient ◽

Convergence Properties ◽

Data Set

Gradient descent method is an essential algorithm for learning of neural networks. Among diverse variations of gradient descent method that have been developed for accelerating learning speed, the natural gradient learning is based on the theory of information geometry on stochastic neuromanifold, and is known to have ideal convergence properties. Despite its theoretical advantages, the pure natural gradient has some limitations that prevent its practical usage. In order to get the explicit value of the natural gradient, it is required to know true probability distribution of input variables, and to calculate inverse of a matrix with the square size of the number of parameters. Though an adaptive estimation of the natural gradient has been proposed as a solution, it was originally developed for online learning mode, which is computationally inefficient for the learning of large data set. In this paper, we propose a novel adaptive natural gradient estimation for mini-batch learning mode, which is commonly adopted for big data analysis. For two representative stochastic neural network models, we present explicit rules of parameter updates and learning algorithm. Through experiments on three benchmark problems, we confirm that the proposed method has superior convergence properties to the conventional methods.

Download Full-text

A Three-Term Gradient Descent Method with Subspace Techniques

Mathematical Problems in Engineering ◽

10.1155/2021/8867309 ◽

2021 ◽

Vol 2021 ◽

pp. 1-7

Author(s):

Shengwei Yao ◽

Yuping Wu ◽

Jielan Yang ◽

Jieqiong Xu

Keyword(s):

Numerical Experiments ◽

Gradient Descent ◽

Line Search ◽

Optimization Problems ◽

Convergence Result ◽

Descent Method ◽

Search Direction ◽

Gradient Descent Method ◽

Approximation Model ◽

Wolfe Line Search

We proposed a three-term gradient descent method that can be well applied to address the optimization problems in this article. The search direction of the obtained method is generated in a specific subspace. Specifically, a quadratic approximation model is applied in the process of generating the search direction. In order to reduce the amount of calculation and make the best use of the existing information, the subspace was made up of the gradient of the current and prior iteration point and the previous search direction. By using the subspace-based optimization technology, the global convergence result is established under Wolfe line search. The results of numerical experiments show that the new method is effective and robust.

Download Full-text

Hybridization of accelerated gradient descent method

Numerical Algorithms ◽

10.1007/s11075-017-0460-4 ◽

2017 ◽

Vol 79 (3) ◽

pp. 769-786 ◽

Cited By ~ 5

Author(s):

Milena Petrović ◽

Vladimir Rakočević ◽

Nataša Kontrec ◽

Stefan Panić ◽

Dejan Ilić

Keyword(s):

Gradient Descent ◽

Descent Method ◽

Gradient Descent Method ◽

Accelerated Gradient

Download Full-text

Decreasing Accelerated Gradient Descent Method for Nonnegative Matrix Factorization

Lecture Notes in Electrical Engineering - Proceedings of the International Conference on Information Engineering and Applications (IEA) 2012 ◽

10.1007/978-1-4471-4856-2_79 ◽

2013 ◽

pp. 655-662

Author(s):

Furui Liu

Keyword(s):

Matrix Factorization ◽

Gradient Descent ◽

Nonnegative Matrix Factorization ◽

Nonnegative Matrix ◽

Descent Method ◽

Gradient Descent Method ◽

Accelerated Gradient

Download Full-text

A Distributed Conjugate Gradient Online Learning Method over Networks

Complexity ◽

10.1155/2020/1390963 ◽

2020 ◽

Vol 2020 ◽

pp. 1-13

Author(s):

Cuixia Xu ◽

Junlong Zhu ◽

Youlin Shang ◽

Qingtao Wu

Keyword(s):

Conjugate Gradient ◽

Gradient Descent ◽

Line Search ◽

Descent Method ◽

Convergence Speed ◽

Gradient Algorithm ◽

Gradient Descent Method ◽

Objective Functions ◽

Regret Bound ◽

Number Of Iterations

In a distributed online optimization problem with a convex constrained set over an undirected multiagent network, the local objective functions are convex and vary over time. Most of the existing methods used to solve this problem are based on the fastest gradient descent method. However, the convergence speed of these methods is decreased with an increase in the number of iterations. To accelerate the convergence speed of the algorithm, we present a distributed online conjugate gradient algorithm, different from a gradient method, in which the search directions are a set of vectors that are conjugated to each other and the step sizes are obtained through an accurate line search. We analyzed the convergence of the algorithm theoretically and obtained a regret bound of OT, where T is the number of iterations. Finally, numerical experiments conducted on a sensor network demonstrate the performance of the proposed algorithm.

Download Full-text

An Estimation Algorithm of Attitude and Heading Under Homogenous Field Based on Improved Gradient Descent Method

2020 27th Saint Petersburg International Conference on Integrated Navigation Systems (ICINS) ◽

10.23919/icins43215.2020.9133763 ◽

2020 ◽

Author(s):

Xiao-Kang Yang ◽

Gong-Min Yan ◽

Si-Hai Li

Keyword(s):

Gradient Descent ◽

Estimation Algorithm ◽

Descent Method ◽

Gradient Descent Method

Download Full-text