STUDYING THE EFFECT OF ADAPTIVE MOMENTUM IN IMPROVING THE ACCURACY OF GRADIENT DESCENT BACK PROPAGATION ALGORITHM ON CLASSIFICATION PROBLEMS

Despite being widely used in the practical problems around the world, Gradient Descent Back-propagation algorithm comes with problems like slow convergence and convergence to local minima. Previous researchers have suggested certain modifications to improve the convergence in gradient Descent Back-propagation algorithm such as careful selection of input weights and biases, learning rate, momentum, network topology, activation function and value for 'gain' in the activation function. This research proposed an algorithm for improving the working performance of back-propagation algorithm which is 'Gradient Descent with Adaptive Momentum (GDAM)' by keeping the gain value fixed during all network trials. The performance of GDAM is compared with 'Gradient Descent with fixed Momentum (GDM)' and 'Gradient Descent Method with Adaptive Gain (GDM-AG)'. The learning rate is fixed to 0.4 and maximum epochs are set to 3000 while sigmoid activation function is used for the experimentation. The results show that GDAM is a better approach than previous methods with an accuracy ratio of 1.0 for classification problems like Wine Quality, Mushroom and Thyroid disease.

Download Full-text

The Effect of Adaptive Momentum in Improving the Accuracy of Gradient Descent Back Propagation Algorithm on Classification Problems

Software Engineering and Computer Systems - Communications in Computer and Information Science ◽

10.1007/978-3-642-22170-5_33 ◽

2011 ◽

pp. 380-390 ◽

Cited By ~ 14

Author(s):

M. Z. Rehman ◽

N. M. Nawi

Keyword(s):

Gradient Descent ◽

Back Propagation ◽

Back Propagation Algorithm ◽

Classification Problems ◽

Propagation Algorithm ◽

Adaptive Momentum

Download Full-text

The Effect of Adaptive Gain and Adaptive Momentum in Improving Training Time of Gradient Descent Back Propagation Algorithm on Classification Problems

International Journal on Advanced Science Engineering and Information Technology ◽

10.18517/ijaseit.1.2.38 ◽

2011 ◽

Vol 1 (2) ◽

pp. 178 ◽

Cited By ~ 5

Author(s):

Norhamreeza Abdul Hamid ◽

Nazri Mohd. Nawi ◽

Rozaida Ghazali

Keyword(s):

Gradient Descent ◽

Back Propagation ◽

Back Propagation Algorithm ◽

Classification Problems ◽

Training Time ◽

Propagation Algorithm ◽

Adaptive Momentum

Download Full-text

SOLVING LOCAL MINIMA PROBLEM IN BACK PROPAGATION ALGORITHM USING ADAPTIVE GAIN, ADAPTIVE MOMENTUM AND ADAPTIVE LEARNING RATE ON CLASSIFICATION PROBLEMS

International Journal of Modern Physics Conference Series ◽

10.1142/s2010194512005533 ◽

2012 ◽

Vol 09 ◽

pp. 448-455 ◽

Cited By ~ 5

Author(s):

NORHAMREEZA ABDUL HAMID ◽

NAZRI MOHD NAWI ◽

ROZAIDA GHAZALI ◽

MOHD NAJIB MOHD SALLEH

Keyword(s):

Adaptive Learning ◽

Gradient Descent ◽

Back Propagation ◽

Learning Rate ◽

Benchmark Problems ◽

Local Minima ◽

Back Propagation Algorithm ◽

Propagation Algorithm ◽

Hidden Layer ◽

Adaptive Momentum

This paper presents a new method to improve back propagation algorithm from getting stuck with local minima problem and slow convergence speeds which caused by neuron saturation in the hidden layer. In this proposed algorithm, each training pattern has its own activation functions of neurons in the hidden layer that are adjusted by the adaptation of gain parameters together with adaptive momentum and learning rate value during the learning process. The efficiency of the proposed algorithm is compared with the conventional back propagation gradient descent and the current working back propagation gradient descent with adaptive gain by means of simulation on three benchmark problems namely iris, glass and thyroid.

Download Full-text

Accelerating Learning Performance of Back Propagation Algorithm by Using Adaptive Gain Together with Adaptive Momentum and Adaptive Learning Rate on Classification Problems

Communications in Computer and Information Science - Ubiquitous Computing and Multimedia Applications ◽

10.1007/978-3-642-20998-7_62 ◽

2011 ◽

pp. 559-570 ◽

Cited By ~ 3

Author(s):

Norhamreeza Abdul Hamid ◽

Nazri Mohd Nawi ◽

Rozaida Ghazali ◽

Mohd Najib Mohd Salleh

Keyword(s):

Adaptive Learning ◽

Back Propagation ◽

Learning Rate ◽

Learning Performance ◽

Back Propagation Algorithm ◽

Classification Problems ◽

Propagation Algorithm ◽

Adaptive Learning Rate ◽

Adaptive Momentum

Download Full-text

Improved Time Training with Accuracy of Batch Back Propagation Algorithm Via Dynamic Learning Rate and Dynamic Momentum Factor

IAES International Journal of Artificial Intelligence (IJ-AI) ◽

10.11591/ijai.v7.i4.pp170-178 ◽

2018 ◽

Vol 7 (4) ◽

pp. 170

Author(s):

Mohammed Sarhan Al_Duais ◽

Fatma Susilawati. Mohamad

Keyword(s):

Back Propagation ◽

Activation Function ◽

Learning Rate ◽

Superior Performance ◽

Sigmoid Function ◽

Back Propagation Algorithm ◽

Dynamic Learning ◽

Propagation Algorithm ◽

Dynamic Function ◽

Xor Problem

The man problem of batch back propagation (BBP) algorithm is slow training and there are several parameters needs to be adjusted manually, also suffers from saturation training.The learning rate and momentum factor are significant parameters for increasing the efficiency of the (BBP). In this study, we created a new dynamic function of each learning rate and momentum facor. We present the DBBPLM algorithm, which trains with a dynamic function for each the learning rate and momentum factor. A Sigmoid function used as activation function. The XOR problem, balance, breast cancer and iris dataset were used as benchmarks for testing the effects of the dynamic DBBPLM algorithm. All the experiments were performed on Matlab 2012 a. The stop training was determined ten power -5. From the experimental results, the DBBPLM algorithm provides superior performance in terms of training, and faster training with higher accuracy compared to the BBP algorithm and with existing works.

Download Full-text

THE BACKPROPAGATION ALGORITHM IN J, A FAST PROTOTYPING TOOL FOR RESEARCHING NEURAL NETWORKS

International Journal of Neural Systems ◽

10.1142/s0129065799000289 ◽

1999 ◽

Vol 09 (04) ◽

pp. 273-284 ◽

Cited By ~ 1

Author(s):

ROELOF K. BROUWER

Keyword(s):

Neural Networks ◽

Gradient Descent ◽

Back Propagation ◽

General Purpose ◽

Descent Method ◽

Gradient Descent Method ◽

Back Propagation Algorithm ◽

Software Packages ◽

Fast Prototyping ◽

Network Simulators

This paper illustrates the use of a powerful language, called J, that is ideal for simulating neural networks. The use of J is demonstrated by its application to a gradient descent method for training a multilayer perceptron. It is also shown how the back-propagation algorithm can be easily generalized to multilayer networks without any increase in complexity and that the algorithm can be completely expressed in an array notation which is directly executable through J. J is a general purpose language, which means that its user is given a flexibility not available in neural network simulators or in software packages such as MATLAB. Yet, because of its numerous operators, J allows a very succinct code to be used, leading to a tremendous decrease in development time.

Download Full-text

Convergence analysis of a back-propagation algorithm with adaptive momentum

Neurocomputing ◽

10.1016/j.neucom.2010.10.008 ◽

2011 ◽

Vol 74 (5) ◽

pp. 749-752 ◽

Cited By ~ 31

Author(s):

Hongmei Shao ◽

Gaofeng Zheng

Keyword(s):

Convergence Analysis ◽

Back Propagation ◽

Back Propagation Algorithm ◽

Propagation Algorithm ◽

Adaptive Momentum

Download Full-text

Learning Efficiency Improvement of Back Propagation Algorithm by Adaptively Changing Gain Parameter together with Momentum and Learning Rate

Software Engineering and Computer Systems - Communications in Computer and Information Science ◽

10.1007/978-3-642-22203-0_68 ◽

2011 ◽

pp. 812-824 ◽

Cited By ~ 1

Author(s):

Norhamreeza Abdul Hamid ◽

Nazri Mohd Nawi ◽

Rozaida Ghazali ◽

Mohd Najib Mohd Salleh

Keyword(s):

Back Propagation ◽

Learning Rate ◽

Efficiency Improvement ◽

Back Propagation Algorithm ◽

Gain Parameter ◽

Learning Efficiency ◽

Propagation Algorithm

Download Full-text

Back-propagation algorithm with variable adaptive momentum

Knowledge-Based Systems ◽

10.1016/j.knosys.2016.10.001 ◽

2016 ◽

Vol 114 ◽

pp. 79-87 ◽

Cited By ~ 34

Author(s):

Alaa Ali Hameed ◽

Bekir Karlik ◽

Mohammad Shukri Salman

Keyword(s):

Back Propagation ◽

Back Propagation Algorithm ◽

Propagation Algorithm ◽

Adaptive Momentum

Download Full-text

Hyperparameter-free optimizer of stochastic gradient descent that incorporates unit correction and moment estimation

10.1101/348557 ◽

2018 ◽

Author(s):

Kazunori D Yamada

Keyword(s):

Deep Learning ◽

Gradient Descent ◽

Mathematical Optimization ◽

Descent Method ◽

Learning Rate ◽

Stochastic Gradient ◽

Stochastic Gradient Descent ◽

Gradient Descent Method ◽

Moment Estimation ◽

Estimation System

ABSTRACTIn the deep learning era, stochastic gradient descent is the most common method used for optimizing neural network parameters. Among the various mathematical optimization methods, the gradient descent method is the most naive. Adjustment of learning rate is necessary for quick convergence, which is normally done manually with gradient descent. Many optimizers have been developed to control the learning rate and increase convergence speed. Generally, these optimizers adjust the learning rate automatically in response to learning status. These optimizers were gradually improved by incorporating the effective aspects of earlier methods. In this study, we developed a new optimizer: YamAdam. Our optimizer is based on Adam, which utilizes the first and second moments of previous gradients. In addition to the moment estimation system, we incorporated an advantageous part of AdaDelta, namely a unit correction system, into YamAdam. According to benchmark tests on some common datasets, our optimizer showed similar or faster convergent performance compared to the existing methods. YamAdam is an option as an alternative optimizer for deep learning.

Download Full-text