A comparison of conventional and deep learning methods of image classification

The aim of the research is to compare traditional and deep learning methods in image classification tasks. The conducted research experiment covers the analysis of five different models of neural networks: two models of multi–layer perceptron architecture: MLP with two hidden layers, MLP with three hidden layers; and three models of convolutional architecture: the three VGG blocks model, AlexNet and GoogLeNet. The models were tested on two different datasets: CIFAR–10 and MNIST and have been applied to the task of image classification. They were tested for classification performance, training speed, and the effect of the complexity of the dataset on the training outcome.

Download Full-text

Convolutional Neural Networks for Water Body Extraction from Landsat Imagery

International Journal of Computational Intelligence and Applications ◽

10.1142/s1469026817500018 ◽

2017 ◽

Vol 16 (01) ◽

pp. 1750001 ◽

Cited By ~ 23

Author(s):

Long Yu ◽

Zhiyin Wang ◽

Shengwei Tian ◽

Feiyue Ye ◽

Jianli Ding ◽

...

Keyword(s):

Neural Networks ◽

Deep Learning ◽

Convolutional Neural Networks ◽

Water Body ◽

Spatial Information ◽

Landsat Imagery ◽

Classification Performance ◽

Support Vector ◽

Learning Methods ◽

Wide Range

Traditional machine learning methods for water body extraction need complex spectral analysis and feature selection which rely on wealth of prior knowledge. They are time-consuming and hard to satisfy our request for accuracy, automation level and a wide range of application. We present a novel deep learning framework for water body extraction from Landsat imagery considering both its spectral and spatial information. The framework is a hybrid of convolutional neural networks (CNN) and logistic regression (LR) classifier. CNN, one of the deep learning methods, has acquired great achievements on various visual-related tasks. CNN can hierarchically extract deep features from raw images directly, and distill the spectral–spatial regularities of input data, thus improving the classification performance. Experimental results based on three Landsat imagery datasets show that our proposed model achieves better performance than support vector machine (SVM) and artificial neural network (ANN).

Download Full-text

Deep Convolutional Neural Networks for Image Classification: A Comprehensive Review

Neural Computation ◽

10.1162/neco_a_00990 ◽

2017 ◽

Vol 29 (9) ◽

pp. 2352-2449 ◽

Cited By ~ 562

Author(s):

Waseem Rawat ◽

Zenghui Wang

Keyword(s):

Neural Networks ◽

Deep Learning ◽

Image Classification ◽

Convolutional Neural Networks ◽

Rapid Progression ◽

Deep Convolutional Neural Networks ◽

Computing Power ◽

Current Trends ◽

Visual Tasks ◽

Classification Tasks

Convolutional neural networks (CNNs) have been applied to visual tasks since the late 1980s. However, despite a few scattered applications, they were dormant until the mid-2000s when developments in computing power and the advent of large amounts of labeled data, supplemented by improved algorithms, contributed to their advancement and brought them to the forefront of a neural network renaissance that has seen rapid progression since 2012. In this review, which focuses on the application of CNNs to image classification tasks, we cover their development, from their predecessors up to recent state-of-the-art deep learning systems. Along the way, we analyze (1) their early successes, (2) their role in the deep learning renaissance, (3) selected symbolic works that have contributed to their recent popularity, and (4) several improvement attempts by reviewing contributions and challenges of over 300 publications. We also introduce some of their current trends and remaining challenges.

Download Full-text

Is the aspect ratio of cells important in deep learning? A robust comparison of deep learning methods for multi-scale cytopathology cell image classification: From convolutional neural networks to visual transformers

Computers in Biology and Medicine ◽

10.1016/j.compbiomed.2021.105026 ◽

2021 ◽

pp. 105026

Author(s):

Wanli Liu ◽

Chen Li ◽

Md Mamunur Rahaman ◽

Tao Jiang ◽

Hongzan Sun ◽

...

Keyword(s):

Neural Networks ◽

Deep Learning ◽

Aspect Ratio ◽

Image Classification ◽

Convolutional Neural Networks ◽

Learning Methods ◽

Multi Scale ◽

Cell Image

Download Full-text

Computational Complexity Reduction of Neural Networks of Brain Tumor Image Segmentation by Introducing Fermi–Dirac Correction Functions

Entropy ◽

10.3390/e23020223 ◽

2021 ◽

Vol 23 (2) ◽

pp. 223

Author(s):

Yen-Ling Tai ◽

Shin-Jhe Huang ◽

Chien-Chang Chen ◽

Henry Horng-Shing Lu

Keyword(s):

Neural Networks ◽

Deep Learning ◽

Computational Complexity ◽

High Performance ◽

Low Cost ◽

Structural Complexity ◽

Correction Function ◽

Computational Time ◽

Learning Methods ◽

Band Theory

Nowadays, deep learning methods with high structural complexity and flexibility inevitably lean on the computational capability of the hardware. A platform with high-performance GPUs and large amounts of memory could support neural networks having large numbers of layers and kernels. However, naively pursuing high-cost hardware would probably drag the technical development of deep learning methods. In the article, we thus establish a new preprocessing method to reduce the computational complexity of the neural networks. Inspired by the band theory of solids in physics, we map the image space into a noninteraction physical system isomorphically and then treat image voxels as particle-like clusters. Then, we reconstruct the Fermi–Dirac distribution to be a correction function for the normalization of the voxel intensity and as a filter of insignificant cluster components. The filtered clusters at the circumstance can delineate the morphological heterogeneity of the image voxels. We used the BraTS 2019 datasets and the dimensional fusion U-net for the algorithmic validation, and the proposed Fermi–Dirac correction function exhibited comparable performance to other employed preprocessing methods. By comparing to the conventional z-score normalization function and the Gamma correction function, the proposed algorithm can save at least 38% of computational time cost under a low-cost hardware architecture. Even though the correction function of global histogram equalization has the lowest computational time among the employed correction functions, the proposed Fermi–Dirac correction function exhibits better capabilities of image augmentation and segmentation.

Download Full-text

Validating Deep Neural Networks for Online Decoding of Motor Imagery Movements from EEG Signals

Sensors ◽

10.3390/s19010210 ◽

2019 ◽

Vol 19 (1) ◽

pp. 210 ◽

Cited By ~ 32

Author(s):

Zied Tayeb ◽

Juri Fedjaev ◽

Nejla Ghaboosi ◽

Christoph Richter ◽

Lukas Everding ◽

...

Keyword(s):

Neural Network ◽

Machine Learning ◽

Deep Learning ◽

Convolutional Neural Network ◽

Motor Imagery ◽

Classification Performance ◽

Feature Engineering ◽

Learning Models ◽

Eeg Signals ◽

Learning Methods

Non-invasive, electroencephalography (EEG)-based brain-computer interfaces (BCIs) on motor imagery movements translate the subject’s motor intention into control signals through classifying the EEG patterns caused by different imagination tasks, e.g., hand movements. This type of BCI has been widely studied and used as an alternative mode of communication and environmental control for disabled patients, such as those suffering from a brainstem stroke or a spinal cord injury (SCI). Notwithstanding the success of traditional machine learning methods in classifying EEG signals, these methods still rely on hand-crafted features. The extraction of such features is a difficult task due to the high non-stationarity of EEG signals, which is a major cause by the stagnating progress in classification performance. Remarkable advances in deep learning methods allow end-to-end learning without any feature engineering, which could benefit BCI motor imagery applications. We developed three deep learning models: (1) A long short-term memory (LSTM); (2) a spectrogram-based convolutional neural network model (CNN); and (3) a recurrent convolutional neural network (RCNN), for decoding motor imagery movements directly from raw EEG signals without (any manual) feature engineering. Results were evaluated on our own publicly available, EEG data collected from 20 subjects and on an existing dataset known as 2b EEG dataset from “BCI Competition IV”. Overall, better classification performance was achieved with deep learning models compared to state-of-the art machine learning techniques, which could chart a route ahead for developing new robust techniques for EEG signal decoding. We underpin this point by demonstrating the successful real-time control of a robotic arm using our CNN based BCI.

Download Full-text

Human Skin Detection in Color Images Using Deep Learning

International Journal of Computer Vision and Image Processing ◽

10.4018/ijcvip.2015070101 ◽

2015 ◽

Vol 5 (2) ◽

pp. 1-13 ◽

Cited By ~ 1

Author(s):

Mohammadreza Hajiarbabi ◽

Arvin Agah

Keyword(s):

Neural Networks ◽

Deep Learning ◽

Human Skin ◽

Skin Color ◽

Color Image ◽

Gaussian Model ◽

Color Images ◽

Skin Detection ◽

Learning Methods ◽

Rule Based

Human skin detection is an important and challenging problem in computer vision. Skin detection can be used as the first phase in face detection when using color images. The differences in illumination and ranges of skin colors have made skin detection a challenging task. Gaussian model, rule based methods, and artificial neural networks are methods that have been used for human skin color detection. Deep learning methods are new techniques in learning that have shown improved classification power compared to neural networks. In this paper the authors use deep learning methods in order to enhance the capabilities of skin detection algorithms. Several experiments have been performed using auto encoders and different color spaces. The proposed technique is evaluated compare with other available methods in this domain using two color image databases. The results show that skin detection utilizing deep learning has better results compared to other methods such as rule-based, Gaussian model and feed forward neural network.

Download Full-text

Facial skin image classification system using Convolutional Neural Networks deep learning algorithm

2018 9th International Conference on Awareness Science and Technology (iCAST) ◽

10.1109/icawst.2018.8517246 ◽

2018 ◽

Author(s):

Chiun-Li Chin ◽

Ming-Chieh Chin ◽

Ting-Yu Tsai ◽

Wei-En Chen

Keyword(s):

Neural Networks ◽

Deep Learning ◽

Image Classification ◽

Convolutional Neural Networks ◽

Classification System ◽

Learning Algorithm ◽

Facial Skin ◽

Deep Learning Algorithm

Download Full-text

Image classification using Deep learning

International Journal of Engineering & Technology ◽

10.14419/ijet.v7i2.7.10892 ◽

2018 ◽

Vol 7 (2.7) ◽

pp. 614 ◽

Cited By ~ 5

Author(s):

M Manoj krishna ◽

M Neelima ◽

M Harshali ◽

M Venu Gopala Rao

Keyword(s):

Machine Learning ◽

Neural Networks ◽

Image Processing ◽

Computer Vision ◽

Deep Learning ◽

Image Classification ◽

Convolutional Neural Networks ◽

Classical Problem

The image classification is a classical problem of image processing, computer vision and machine learning fields. In this paper we study the image classification using deep learning. We use AlexNet architecture with convolutional neural networks for this purpose. Four test images are selected from the ImageNet database for the classification purpose. We cropped the images for various portion areas and conducted experiments. The results show the effectiveness of deep learning based image classification using AlexNet.

Download Full-text

Integrating Audio Signal Processing and Deep Learning Algorithms for Gait Pattern Classification in Brazilian Gaited Horses

Frontiers in Animal Science ◽

10.3389/fanim.2021.681557 ◽

2021 ◽

Vol 2 ◽

Author(s):

Anderson Antonio Carvalho Alves ◽

Lucas Tassoni Andrietta ◽

Rafael Zinni Lopes ◽

Fernando Oliveira Bussiman ◽

Fabyano Fonseca e Silva ◽

...

Keyword(s):

Neural Networks ◽

Signal Processing ◽

Deep Learning ◽

Audio Signal ◽

Gait Pattern ◽

Classification Performance ◽

Audio Signal Processing ◽

Gait Patterns ◽

Audio Features ◽

Gaited Horses

This study focused on assessing the usefulness of using audio signal processing in the gaited horse industry. A total of 196 short-time audio files (4 s) were collected from video recordings of Brazilian gaited horses. These files were converted into waveform signals (196 samples by 80,000 columns) and divided into training (N = 164) and validation (N = 32) datasets. Twelve single-valued audio features were initially extracted to summarize the training data according to the gait patterns (Marcha Batida—MB and Marcha Picada—MP). After preliminary analyses, high-dimensional arrays of the Mel Frequency Cepstral Coefficients (MFCC), Onset Strength (OS), and Tempogram (TEMP) were extracted and used as input information in the classification algorithms. A principal component analysis (PCA) was performed using the 12 single-valued features set and each audio-feature dataset—AFD (MFCC, OS, and TEMP) for prior data visualization. Machine learning (random forest, RF; support vector machine, SVM) and deep learning (multilayer perceptron neural networks, MLP; convolution neural networks, CNN) algorithms were used to classify the gait types. A five-fold cross-validation scheme with 10 repetitions was employed for assessing the models' predictive performance. The classification performance across models and AFD was also validated with independent observations. The models and AFD were compared based on the classification accuracy (ACC), specificity (SPEC), sensitivity (SEN), and area under the curve (AUC). In the logistic regression analysis, five out of the 12 audio features extracted were significant (p < 0.05) between the gait types. ACC averages ranged from 0.806 to 0.932 for MFCC, from 0.758 to 0.948 for OS and, from 0.936 to 0.968 for TEMP. Overall, the TEMP dataset provided the best classification accuracies for all models. The most suitable method for audio-based horse gait pattern classification was CNN. Both cross and independent validation schemes confirmed that high values of ACC, SPEC, SEN, and AUC are expected for yet-to-be-observed labels, except for MFCC-based models, in which clear overfitting was observed. Using audio-generated data for describing gait phenotypes in Brazilian horses is a promising approach, as the two gait patterns were correctly distinguished. The highest classification performance was achieved by combining CNN and the rhythmic-descriptive AFD.

Download Full-text

Label Rectification Learning through Kernel Extreme Learning Machine

Wireless Communications and Mobile Computing ◽

10.1155/2021/6669081 ◽

2021 ◽

Vol 2021 ◽

pp. 1-6

Author(s):

Qiang Cai ◽

Fenghai Li ◽

Yifan Chen ◽

Haisheng Li ◽

Jian Cao ◽

...

Keyword(s):

Neural Network ◽

Convolutional Neural Network ◽

Image Classification ◽

Extreme Learning Machine ◽

Classification Performance ◽

Considerable Progress ◽

Strong Representation ◽

Kernel Extreme Learning Machine ◽

Classification Tasks ◽

Learning Machine

Along with the strong representation of the convolutional neural network (CNN), image classification tasks have achieved considerable progress. However, majority of works focus on designing complicated and redundant architectures for extracting informative features to improve classification performance. In this study, we concentrate on rectifying the incomplete outputs of CNN. To be concrete, we propose an innovative image classification method based on Label Rectification Learning (LRL) through kernel extreme learning machine (KELM). It mainly consists of two steps: (1) preclassification, extracting incomplete labels through a pretrained CNN, and (2) label rectification, rectifying the generated incomplete labels by the KELM to obtain the rectified labels. Experiments conducted on publicly available datasets demonstrate the effectiveness of our method. Notably, our method is extensible which can be easily integrated with off-the-shelf networks for improving performance.

Download Full-text