Application Research of Deep Convolutional Neural Network in Computer Vision

<p>As an important research achievement in the field of brain like computing, deep convolution neural network has been widely used in many fields such as computer vision, natural language processing, information retrieval, speech recognition, semantic understanding and so on. It has set off a wave of neural network research in industry and academia and promoted the development of artificial intelligence. At present, the deep convolution neural network mainly simulates the complex hierarchical cognitive laws of the human brain by increasing the number of layers of the network, using a larger training data set, and improving the network structure or training learning algorithm of the existing neural network, so as to narrow the gap with the visual system of the human brain and enable the machine to acquire the capability of "abstract concepts". Deep convolution neural network has achieved great success in many computer vision tasks such as image classification, target detection, face recognition, pedestrian recognition, etc. Firstly, this paper reviews the development history of convolutional neural networks. Then, the working principle of the deep convolution neural network is analyzed in detail. Then, this paper mainly introduces the representative achievements of convolution neural network from the following two aspects, and shows the improvement effect of various technical methods on image classification accuracy through examples. From the aspect of adding network layers, the structures of classical convolutional neural networks such as AlexNet, ZF-Net, VGG, GoogLeNet and ResNet are discussed and analyzed. From the aspect of increasing the size of data set, the difficulties of manually adding labeled samples and the effect of using data amplification technology on improving the performance of neural network are introduced. This paper focuses on the latest research progress of convolution neural network in image classification and face recognition. Finally, the problems and challenges to be solved in future brain-like intelligence research based on deep convolution neural network are proposed.</p>

Download Full-text

Image Classification Algorithm Based on Big Data and Multilabel Learning of Improved Convolutional Neural Network

Wireless Communications and Mobile Computing ◽

10.1155/2021/3138398 ◽

2021 ◽

Vol 2021 ◽

pp. 1-11

Author(s):

Haibin Chang ◽

Ying Cui

Keyword(s):

Neural Network ◽

Neural Networks ◽

Convolutional Neural Network ◽

Image Classification ◽

Convolutional Neural Networks ◽

Classification Algorithm ◽

Large Set ◽

Data Set ◽

Multilabel Learning ◽

Average Maximum

More and more image materials are used in various industries these days. Therefore, how to collect useful images from a large set has become an urgent priority. Convolutional neural networks (CNN) have achieved good results in certain image classification tasks, but there are still problems such as poor classification ability, low accuracy, and slow convergence speed. This article mainly introduces the image classification algorithm (ICA) research based on the multilabel learning of the improved convolutional neural network and some improvement ideas for the research of the ICA based on the multilabel learning of the convolutional neural network. This paper proposes an ICA research method based on multilabel learning of improved convolutional neural networks, including the image classification process, convolutional network algorithm, and multilabel learning algorithm. The conclusions show that the average maximum classification accuracy of the improved CNN in this paper is 90.63%, and the performance is better, which is beneficial to improving the efficiency of image classification. The improved CNN network structure has reached the highest accuracy rate of 91.47% on the CIFAR-10 data set, which is much higher than the traditional CNN algorithm.

Download Full-text

IMPROVING FACE RECOGNITION MODELS USING CONVOLUTIONAL NEURAL NETWORKS, METRIC LEARNING AND OPTIMIZATION METHOD

Journal of Automation and Information sciences ◽

10.34229/1028-0979-2021-5-11 ◽

2021 ◽

Vol 5 ◽

pp. 140-158

Author(s):

Andrey Litvynchuk ◽

◽

Lesya Baranovska ◽

Keyword(s):

Neural Network ◽

Neural Networks ◽

Computer Vision ◽

Face Recognition ◽

Comparative Analysis ◽

Convolutional Neural Networks ◽

Network Architecture ◽

Metric Learning ◽

Stochastic Gradient Descent ◽

Google Search

Face recognition is one of the main tasks of computer vision. It has many applications, which has led to a huge amount of research in this area. And although research in the field has been going on since the beginning of the computer vision, good results could be achieved only with the help of convolutional neural networks. In this work, a comparative analysis of facial recognition methods before convolutional neural networks was performed. A set of neural network architectures, methods of metric learning and optimization are considered. There were performed bunch of experiments and comparative analysis of the considered methods of improvement of convolutional neural networks. As a result a universal algorithm for training the face recognition model was obtained. To compare different approaches of face recognition, we chose a dataset called VGGFace2. It consists of 3,31 million images of 9131 people. It was created using images from the Google search engine. Initially, pre-trained neural networks were used to select photographs with humans. The images were then checked mannualy. For the validation sample, we set aside 50 images of 500 people, for a total of 25,000 images. Almost all experiments were performed iteratively. For example, we choose the best optimizer and then we use it to search for best arctitecture. As expected, neural networks with more parameters and more sophisticated architecture showed better results in this task. Among the considered models the best was Se-ResNet50. Metric learning is a method by which it is possible to achieve good accuracy in face recognition. Without this method it would be impossible to solve the problem. To optimize neural networks, we considered both adaptive and simple optimizers. It turned out that the stochastic gradient descent with moment is the best for this problem, and adaptive methods showed a rather poor result. In general, using different approaches, we were able to obtain an accuracy of 92 %, which is 25,5 % better than the baseline experiment. We see next ways for the further development of the research subject: improving neural network architecture, collecting more data and applying better regularization techniques.

Download Full-text

Deep Convolution Neural Network with 2-Stage Transfer Learning for Medical Image Classification

The Brain & Neural Networks ◽

10.3902/jnns.24.3 ◽

2017 ◽

Vol 24 (1) ◽

pp. 3-12 ◽

Cited By ~ 1

Author(s):

Hayaru Shouno ◽

Aiga Suzuki ◽

Satoshi Suzuki ◽

Shoji Kido

Keyword(s):

Neural Network ◽

Image Classification ◽

Transfer Learning ◽

Medical Image ◽

Convolution Neural Network ◽

Deep Convolution Neural Network ◽

Medical Image Classification

Download Full-text

Image classification using Deep learning

International Journal of Engineering & Technology ◽

10.14419/ijet.v7i2.7.10892 ◽

2018 ◽

Vol 7 (2.7) ◽

pp. 614 ◽

Cited By ~ 5

Author(s):

M Manoj krishna ◽

M Neelima ◽

M Harshali ◽

M Venu Gopala Rao

Keyword(s):

Machine Learning ◽

Neural Networks ◽

Image Processing ◽

Computer Vision ◽

Deep Learning ◽

Image Classification ◽

Convolutional Neural Networks ◽

Classical Problem

The image classification is a classical problem of image processing, computer vision and machine learning fields. In this paper we study the image classification using deep learning. We use AlexNet architecture with convolutional neural networks for this purpose. Four test images are selected from the ImageNet database for the classification purpose. We cropped the images for various portion areas and conducted experiments. The results show the effectiveness of deep learning based image classification using AlexNet.

Download Full-text

Deep Convolution Neural Network for Big Data Medical Image Classification

IEEE Access ◽

10.1109/access.2020.2998808 ◽

2020 ◽

Vol 8 ◽

pp. 105659-105670 ◽

Cited By ~ 1

Author(s):

Rehan Ashraf ◽

Muhammad Asif Habib ◽

Muhammad Akram ◽

Muhammad Ahsan Latif ◽

Muhammad Sheraz Arshad Malik ◽

...

Keyword(s):

Neural Network ◽

Big Data ◽

Image Classification ◽

Medical Image ◽

Convolution Neural Network ◽

Deep Convolution Neural Network ◽

Medical Image Classification

Download Full-text

Automatic Gully Detection: Neural Networks and Computer Vision

Remote Sensing ◽

10.3390/rs12111743 ◽

2020 ◽

Vol 12 (11) ◽

pp. 1743

Author(s):

Artur M. Gafurov ◽

Oleg P. Yermolayev

Keyword(s):

Neural Network ◽

Neural Networks ◽

Convolutional Neural Networks ◽

Large Scale ◽

Satellite Images ◽

Gully Erosion ◽

Training Data ◽

Russian Plain ◽

Data Set ◽

High Resolution Satellite Images

Transition from manual (visual) interpretation to fully automated gully detection is an important task for quantitative assessment of modern gully erosion, especially when it comes to large mapping areas. Existing approaches to semi-automated gully detection are based on either object-oriented selection based on multispectral images or gully selection based on a probabilistic model obtained using digital elevation models (DEMs). These approaches cannot be used for the assessment of gully erosion on the territory of the European part of Russia most affected by gully erosion due to the lack of national large-scale DEM and limited resolution of open source multispectral satellite images. An approach based on the use of convolutional neural networks for automated gully detection on the RGB-synthesis of ultra-high resolution satellite images publicly available for the test region of the east of the Russian Plain with intensive basin erosion has been proposed and developed. The Keras library and U-Net architecture of convolutional neural networks were used for training. Preliminary results of application of the trained gully erosion convolutional neural network (GECNN) allow asserting that the algorithm performs well in detecting active gullies, well differentiates gullies from other linear forms of slope erosion — rills and balkas, but so far has errors in detecting complex gully systems. Also, GECNN does not identify a gully in 10% of cases and in another 10% of cases it identifies not a gully. To solve these problems, it is necessary to additionally train the neural network on the enlarged training data set.

Download Full-text

Deep convolution neural network with stacks of multi-scale convolutional layer block using triplet of faces for face recognition in the wild

2016 IEEE International Conference on Systems, Man, and Cybernetics (SMC) ◽

10.1109/smc.2016.7844934 ◽

2016 ◽

Cited By ~ 2

Author(s):

Bong-Nam Kang ◽

Yonghyun Kim ◽

Daijin Kim

Keyword(s):

Neural Network ◽

Face Recognition ◽

Convolution Neural Network ◽

Multi Scale ◽

In The Wild ◽

Deep Convolution Neural Network

Download Full-text

Image Classification: A Survey

Journal of Informatics Electrical and Electronics Engineering (JIEEE) ◽

10.54060/jieee/001.02.002 ◽

2020 ◽

Vol 1 (2) ◽

pp. 1-9

Author(s):

Ankita Singh ◽

◽

Pawan Singh

Keyword(s):

Neural Network ◽

Neural Networks ◽

Deep Learning ◽

Image Classification ◽

Convolutional Neural Networks ◽

Language Processing ◽

Classification Accuracy ◽

Deep Neural Network ◽

Learning Ability ◽

Final Decision

The Classification of images is a paramount topic in artificial vision systems which have drawn a notable amount of interest over the past years. This field aims to classify an image, which is an input, based on its visual content. Currently, most people relied on hand-crafted features to describe an image in a particular way. Then, using classifiers that are learnable, such as random forest, and decision tree was applied to the extract features to come to a final decision. The problem arises when large numbers of photos are concerned. It becomes a too difficult problem to find features from them. This is one of the reasons that the deep neural network model has been introduced. Owing to the existence of Deep learning, it can become feasible to represent the hierarchical nature of features using a various number of layers and corresponding weight with them. The existing image classification methods have been gradually applied in real-world problems, but then there are various problems in its application processes, such as unsatisfactory effect and extremely low classification accuracy or then and weak adaptive ability. Models using deep learning concepts have robust learning ability, which combines the feature extraction and the process of classification into a whole which then completes an image classification task, which can improve the image classification accuracy effectively. Convolutional Neural Networks are a powerful deep neural network technique. These networks preserve the spatial structure of a problem and were built for object recognition tasks such as classifying an image into respective classes. Neural networks are much known because people are getting a state-of-the-art outcome on complex computer vision and natural language processing tasks. Convolutional neural networks have been extensively used.

Download Full-text

A convolutional neural network for predicting transcriptional regulators of genes in Arabidopsis transcriptome data reveals classification based on positive regulatory interactions

10.1101/618926 ◽

2019 ◽

Cited By ~ 3

Author(s):

Dan MacLean

Keyword(s):

Neural Network ◽

Gene Expression ◽

Neural Networks ◽

Deep Learning ◽

Convolutional Neural Network ◽

Convolutional Neural Networks ◽

Expression Profiles ◽

Biological Data ◽

Expression Data ◽

Data Set

AbstractGene Regulatory networks that control gene expression are widely studied yet the interactions that make them up are difficult to predict from high throughput data. Deep Learning methods such as convolutional neural networks can perform surprisingly good classifications on a variety of data types and the matrix-like gene expression profiles would seem to be ideal input data for deep learning approaches. In this short study I compiled training sets of expression data using the Arabidopsis AtGenExpress global stress expression data set and known transcription factor-target interactions from the Arabidopsis PLACE database. I built and optimised convolutional neural networks with a best model providing 95 % accuracy of classification on a held-out validation set. Investigation of the activations within this model revealed that classification was based on positive correlation of expression profiles in short sections. This result shows that a convolutional neural network can be used to make classifications and reveal the basis of those calssifications for gene expression data sets, indicating that a convolutional neural network is a useful and interpretable tool for exploratory classification of biological data. The final model is available for download and as a web application.

Download Full-text

A Study of The Convolutional Neural Networks Applications

UKH Journal of Science and Engineering ◽

10.25079/ukhjse.v3n2y2019.pp31-40 ◽

2019 ◽

Vol 3 (2) ◽

pp. 31-40 ◽

Cited By ~ 2

Author(s):

Ahmed Shamsaldin ◽

Polla Fattah ◽

Tarik Rashid ◽

Nawzad Al-Salihi

Keyword(s):

Neural Networks ◽

Computer Vision ◽

Deep Learning ◽

Natural Language Processing ◽

Face Recognition ◽

Convolutional Neural Networks ◽

Language Processing ◽

Text Classification ◽

Scene Labeling ◽

Real World Problems

At present, deep learning is widely used in a broad range of arenas. A convolutional neural networks (CNN) is becoming the star of deep learning as it gives the best and most precise results when cracking real-world problems. In this work, a brief description of the applications of CNNs in two areas will be presented: First, in computer vision, generally, that is, scene labeling, face recognition, action recognition, and image classification; Second, in natural language processing, that is, the fields of speech recognition and text classification.

Download Full-text