Learning Rate Optimization in Convolutional Neural Networks for Medical Images Classification

© 2019, Springer Nature Switzerland AG. Image classification is a difficult machine learning task, where Convolutional Neural Networks (CNNs) have been applied for over 20 years in order to solve the problem. In recent years, instead of the traditional way of only connecting the current layer with its next layer, shortcut connections have been proposed to connect the current layer with its forward layers apart from its next layer, which has been proved to be able to facilitate the training process of deep CNNs. However, there are various ways to build the shortcut connections, it is hard to manually design the best shortcut connections when solving a particular problem, especially given the design of the network architecture is already very challenging. In this paper, a hybrid evolutionary computation (EC) method is proposed to automatically evolve both the architecture of deep CNNs and the shortcut connections. Three major contributions of this work are: Firstly, a new encoding strategy is proposed to encode a CNN, where the architecture and the shortcut connections are encoded separately; Secondly, a hybrid two-level EC method, which combines particle swarm optimisation and genetic algorithms, is developed to search for the optimal CNNs; Lastly, an adjustable learning rate is introduced for the fitness evaluations, which provides a better learning rate for the training process given a fixed number of epochs. The proposed algorithm is evaluated on three widely used benchmark datasets of image classification and compared with 12 peer Non-EC based competitors and one EC based competitor. The experimental results demonstrate that the proposed method outperforms all of the peer competitors in terms of classification accuracy.

Download Full-text

Investigation of Convolutional Neural Networks in the Tasks of Medical Images Analysis and Classification of Breast Tumors

10.1109/cisp-bmei53629.2021.9624326 ◽

2021 ◽

Author(s):

Yuriy Zaychenko ◽

Helen Zaichenko ◽

Galib Hamidov

Keyword(s):

Neural Networks ◽

Convolutional Neural Networks ◽

Medical Images ◽

Breast Tumors

Download Full-text

BE-FNet: 3D Bounding Box Estimation Feature Pyramid Network for Accurate and Efficient Maxillary Sinus Segmentation

Mathematical Problems in Engineering ◽

10.1155/2020/5689301 ◽

2020 ◽

Vol 2020 ◽

pp. 1-16

Author(s):

Zhuofu Deng ◽

Binbin Wang ◽

Zhiliang Zhu

Keyword(s):

Neural Networks ◽

Maxillary Sinus ◽

Convolutional Neural Networks ◽

Feature Fusion ◽

Medical Images ◽

Medical Image Segmentation ◽

3D Segmentation ◽

Deep Convolutional Neural Networks ◽

Bounding Box ◽

Proposed Model

Maxillary sinus segmentation plays an important role in the choice of therapeutic strategies for nasal disease and treatment monitoring. Difficulties in traditional approaches deal with extremely heterogeneous intensity caused by lesions, abnormal anatomy structures, and blurring boundaries of cavity. 2D and 3D deep convolutional neural networks have grown popular in medical image segmentation due to utilization of large labeled datasets to learn discriminative features. However, for 3D segmentation in medical images, 2D networks are not competent in extracting more significant spacial features, and 3D ones suffer from unbearable burden of computation, which results in great challenges to maxillary sinus segmentation. In this paper, we propose a deep neural network with an end-to-end manner to generalize a fully automatic 3D segmentation. At first, our proposed model serves a symmetrical encoder-decoder architecture for multitask of bounding box estimation and in-region 3D segmentation, which cannot reduce excessive computation requirements but eliminate false positives remarkably, promoting 3D segmentation applied in 3D convolutional neural networks. In addition, an overestimation strategy is presented to avoid overfitting phenomena in conventional multitask networks. Meanwhile, we introduce residual dense blocks to increase the depth of the proposed network and attention excitation mechanism to improve the performance of bounding box estimation, both of which bring little influence to computation cost. Especially, the structure of multilevel feature fusion in the pyramid network strengthens the ability of identification to global and local discriminative features in foreground and background achieving more advanced segmentation results. At last, to address problems of blurring boundary and class imbalance in medical images, a hybrid loss function is designed for multiple tasks. To illustrate the strength of our proposed model, we evaluated it against the state-of-the-art methods. Our model performed better significantly with an average Dice 0.947±0.031, VOE 10.23±5.29, and ASD 2.86±2.11, respectively, which denotes a promising technique with strong robust in practice.

Download Full-text

Predicting Semantic Descriptions from Medical Images with Convolutional Neural Networks

Lecture Notes in Computer Science - Information Processing in Medical Imaging ◽

10.1007/978-3-319-19992-4_34 ◽

2015 ◽

pp. 437-448 ◽

Cited By ~ 26

Author(s):

Thomas Schlegl ◽

Sebastian M. Waldstein ◽

Wolf-Dieter Vogl ◽

Ursula Schmidt-Erfurth ◽

Georg Langs

Keyword(s):

Neural Networks ◽

Convolutional Neural Networks ◽

Medical Images

Download Full-text

Plant Diseases Identification through a Discount Momentum Optimizer in Deep Learning

Applied Sciences ◽

10.3390/app11209468 ◽

2021 ◽

Vol 11 (20) ◽

pp. 9468

Author(s):

Yunyun Sun ◽

Yutong Liu ◽

Haocheng Zhou ◽

Huijuan Hu

Keyword(s):

Neural Networks ◽

Deep Learning ◽

Convolutional Neural Networks ◽

Adaptive Learning ◽

Learning Rate ◽

Plant Diseases ◽

Stochastic Gradient Descent ◽

Automatic Identification ◽

Deep Convolutional Neural Networks ◽

Adaptive Learning Rate

Deep learning proves its promising results in various domains. The automatic identification of plant diseases with deep convolutional neural networks attracts a lot of attention at present. This article extends stochastic gradient descent momentum optimizer and presents a discount momentum (DM) deep learning optimizer for plant diseases identification. To examine the recognition and generalization capability of the DM optimizer, we discuss the hyper-parameter tuning and convolutional neural networks models across the plantvillage dataset. We further conduct comparison experiments on popular non-adaptive learning rate methods. The proposed approach achieves an average validation accuracy of no less than 97% for plant diseases prediction on several state-of-the-art deep learning models and holds a low sensitivity to hyper-parameter settings. Experimental results demonstrate that the DM method can bring a higher identification performance, while still maintaining a competitive performance over other non-adaptive learning rate methods in terms of both training speed and generalization.

Download Full-text

Improving Semantic Segmentation of 3D Medical Images on 3D Convolutional Neural Networks

2019 XLV Latin American Computing Conference (CLEI) ◽

10.1109/clei47609.2019.235102 ◽

2019 ◽

Author(s):

Alejandra Marquez Herrera ◽

Alex J. Cuadros-Vargas ◽

Helio Pedrini

Keyword(s):

Neural Networks ◽

Convolutional Neural Networks ◽

Medical Images ◽

Semantic Segmentation

Download Full-text

IMPROVING FACE RECOGNITION MODELS USING METRIC LEARNING, LEARNING RATE SCHEDULERS, AND AUGMENTATIONS

Journal of Automation and Information sciences ◽

10.34229/1028-0979-2021-6-9 ◽

2021 ◽

Vol 6 ◽

pp. 93-101

Author(s):

Andrey Litvynchuk ◽

◽

Lesia Baranovska ◽

◽

Keyword(s):

Neural Networks ◽

Computer Vision ◽

Face Recognition ◽

Comparative Analysis ◽

Convolutional Neural Networks ◽

Facial Recognition ◽

Metric Learning ◽

Learning Rate ◽

Training Dataset ◽

Wide Range

Face recognition is one of the main tasks of computer vision, which is relevant due to its practical significance and great interest of wide range of scientists. It has many applications, which has led to a huge amount of research in this area. And although research in the field has been going on since the beginning of the computer vision, good results could be achieved only with the help of convolutional neural networks. In this work, a comparative analysis of facial recognition methods before convolutional neural networks was performed. A metric learning approach, augmentations and learning rate schedulers are considered. There were performed bunch of experiments and comparative analysis of the considered methods of improvement of convolutional neural networks. As a result a universal algorithm for training the face recognition model was obtained. In this work, we used SE-ResNet50 as the only neural network for experiments. Metric learning is a method by which it is possible to achieve good accuracy in face recognition. Overfitting is a big problem of neural networks, in particular because they have too many parameters and usually not enough data to guarantee the generalization of the model. Additional data labeling can be time-consuming and expensive, so there is such an approach as augmentation. Augmentations artificially increase the training dataset, so as expected, this method improved the results relative to the original experiment in all experiments. Different degrees and more aggressive forms of augmentation in this work led to better results. As expected, the best learning rate scheduler was cosine scheduler with warm-ups and restarts. This schedule has few parameters, so it is also easy to use. In general, using different approaches, we were able to obtain an accuracy of 93,5 %, which is 22 % better than the baseline experiment. In the following studies, it is planned to consider improving not only the model of facial recognition, but also detection. The accuracy of face detection directly depends on the quality of face recognition.

Download Full-text