Adaptive Learning Rate Adjustment with Short-Term Pre-Training in Data-Parallel Deep Learning

Author(s):  
Kazuki Yamada ◽  
Haruki Mori ◽  
Tetsuya Youkawa ◽  
Yuki Miyauchi ◽  
Shintaro Izumi ◽  
...  
2021 ◽  
Vol 11 (20) ◽  
pp. 9468
Author(s):  
Yunyun Sun ◽  
Yutong Liu ◽  
Haocheng Zhou ◽  
Huijuan Hu

Deep learning proves its promising results in various domains. The automatic identification of plant diseases with deep convolutional neural networks attracts a lot of attention at present. This article extends stochastic gradient descent momentum optimizer and presents a discount momentum (DM) deep learning optimizer for plant diseases identification. To examine the recognition and generalization capability of the DM optimizer, we discuss the hyper-parameter tuning and convolutional neural networks models across the plantvillage dataset. We further conduct comparison experiments on popular non-adaptive learning rate methods. The proposed approach achieves an average validation accuracy of no less than 97% for plant diseases prediction on several state-of-the-art deep learning models and holds a low sensitivity to hyper-parameter settings. Experimental results demonstrate that the DM method can bring a higher identification performance, while still maintaining a competitive performance over other non-adaptive learning rate methods in terms of both training speed and generalization.


2018 ◽  
Vol 6 (1) ◽  
pp. 75-91 ◽  
Author(s):  
Hilmy Assodiky ◽  
Iwan Syarif ◽  
Tessy Badriyah

Arrhythmia is a heartbeat abnormality that can be harmless or harmful. It depends on what kind of arrhythmia that the patient suffers. People with arrhythmia usually feel the same physical symptoms but every arrhythmia requires different treatments. For arrhythmia detection, the cardiologist uses electrocardiogram that represents the cardiac electrical activity. And it is a kind of sequential data with high complexity. So the high performance classification method to help the arrhythmia detection is needed. In this paper, Long Short-Term Memory (LSTM) method was used to classify the arrhythmia. The performance was boosted by using AdaDelta as the adaptive learning rate method. As a comparison, it was compared to LSTM without adaptive learning rate. And the best result that showed high accuracy was obtained by using LSTM with AdaDelta. The correct classification rate was 98% for train data and 97% for test data.


Author(s):  
Mahmoud Smaida ◽  
Serhii Yaroshchak ◽  
Ahmed Y. Ben Sasi

One of the most important hyper-parameters for model training and generalization is the learning rate. Recently, many research studies have shown that optimizing the learning rate schedule is very useful for training deep neural networks to get accurate and efficient results. In this paper, different learning rate schedules using some comprehensive optimization techniques have been compared in order to measure the accuracy of a convolutional neural network CNN model to classify four ophthalmic conditions. In this work, a deep learning CNN based on Keras and TensorFlow has been deployed using Python on a database that contains 1692 images, which consists of four types of ophthalmic cases: Glaucoma, Myopia, Diabetic retinopathy, and Normal eyes. The CNN model has been trained on Google Colab. GPU with different learning rate schedules and adaptive learning algorithms. Constant learning rate, time-based decay, step-based decay, exponential decay, and adaptive learning rate optimization techniques for deep learning have been addressed. Adam adaptive learning rate method. has outperformed the other optimization techniques and achieved the best model accuracy of 92.58% for training set and 80.49% for validation datasets, respectively.


2019 ◽  
Vol 128 ◽  
pp. 197-203 ◽  
Author(s):  
Jinxiu Liang ◽  
Yong Xu ◽  
Chenglong Bao ◽  
Yuhui Quan ◽  
Hui Ji

2011 ◽  
Vol 121-126 ◽  
pp. 705-709
Author(s):  
Xiu Hua Zhang ◽  
Fu Jun Ren ◽  
Yong Cheng Jiang

BP network is the most widely used of the neural net work model, but there are many problems of slow convergence speed and easily getting into the local minimum in the conventional BP algorithm. For this, an improved algorithm is proposed. Momentum term is added, steepness factors are introduced and adaptive learning rate adjustment factor is added. In the Matlab platform simulations are carried out by each improvement methods on the same BP neural network. The results show that: Convergence of improved BP network is decreased from 1000 to 49 and the error is decreased from 10-2 to 10-6. The convergence speed has been significantly improved and the error has been decreased. Using the synthesis improvement method effect is obvious and it provides a good theoretical basis for the practical application.


Sign in / Sign up

Export Citation Format

Share Document