Classification of clothing images based on a parallel convolutional neural network and random vector functional link optimized by the grasshopper optimization algorithm

2021 ◽  
pp. 004051752110592
Author(s):  
Zhiyu Zhou ◽  
Wenxiong Deng ◽  
Yaming Wang ◽  
Zefei Zhu

To improve accuracy in clothing image recognition, this paper proposes a clothing classification method based on a parallel convolutional neural network (PCNN) combined with an optimized random vector functional link (RVFL). The method uses the PCNN model to extract features of clothing images. Then, the structure-intensive and dual-channel convolutional neural network (i.e., the PCNN) is used to solve the problems of traditional convolutional neural networks (e.g., limited data and prone to overfitting). Each convolutional layer is followed by a batch normalization layer, and the leaky rectified linear unit activation function and max-pooling layers are used to improve the performance of the feature extraction. Then, dropout layers and fully connected layers are used to reduce the amount of calculation. The last layer uses the RVFL as optimized by the grasshopper optimization algorithm to replace the SoftMax layer and classify the features, further improving the stability and accuracy of classification. In this study, two aspects of the classification (feature extraction and feature classification) are improved, effectively improving the accuracy. The experimental results show that on the Fashion-Mnist dataset, the accuracy of the algorithm in this study reaches 92.93%. This value is 1.36%, 2.05%, 0.65%, and 3.76% higher than that of the local binary pattern (LBP)-support vector machine (SVM), histogram of oriented gradients (HOG)-SVM, LBP-HOG-SVM, and AlexNet-sparse representation-based classifier algorithms, respectively, effectively demonstrating the classification performance of the algorithm.

2021 ◽  
Vol 19 (8) ◽  
pp. 169-181
Author(s):  
P. Renukadevi ◽  
Dr.A. Rajiv Kannan

Recently the COVID’19 is extensively increasing around the world with many challenges for researchers. Rigorous respiratory disease corona virus 2 show aggression to many parts of COVID’19 affected patients, together with brain and lungs. The changeableness of Corona virus with likely to infect Central Nervous System emphasize the necessity for technological development to identify, handle, and take care of brain damages in COVID’19 patients. An exact short-term predicting the quantity of newly infected and cured cases is vital for resource optimization to stop or reduce the growth of infection. The previous system designed a Linear Decreasing Inertia Weight based Cat Swarm Optimization with Half Binomial Distribution based Convolutional Neural Network (LDIWCSO-HBDCNN) approach for COVID-19 forecasting. However, the ensemble learning is required to improve the prediction outcome via integrating many approaches. This approach allows the production of better predictive performance compared to a single model. For solving this problem, the proposed system designed an Improved Linear Factor based Grasshopper Optimization Algorithm with Ensemble Learning (ILFGOA with EL) for covid-19 forecasting. Initially, the COVID-19 forecasting dataset is taken as an input. With the help of min-max approach, data normalization is done. Then the optimal features are selected by using Improved Linear Factor based Grasshopper Optimization Algorithm (ILFGOA) algorithm to improve the prediction accuracy. Based on the selected features, Ensemble Learning (EL) which includes Hyperparameter based Convolutional Neural Network (HCNN) is utilized to identify infected and demise cases across india for a period of time. The outcome of analysis shows that the introduced method attains better execution against previous system with regard to error rate, accuracy, precision, recall and f-measure.


2021 ◽  
Vol 21 (01) ◽  
pp. 2150005
Author(s):  
ARUN T NAIR ◽  
K. MUTHUVEL

Nowadays, analysis on retinal image exists as one of the challenging area for study. Numerous retinal diseases could be recognized by analyzing the variations taking place in retina. However, the main disadvantage among those studies is that, they do not have higher recognition accuracy. The proposed framework includes four phases namely, (i) Blood Vessel Segmentation (ii) Feature Extraction (iii) Optimal Feature Selection and (iv) Classification. Initially, the input fundus image is subjected to blood vessel segmentation from which two binary thresholded images (one from High Pass Filter (HPF) and other from top-hat reconstruction) are acquired. These two images are differentiated and the areas that are common to both are said to be the major vessels and the left over regions are fused to form vessel sub-image. These vessel sub-images are classified with Gaussian Mixture Model (GMM) classifier and the resultant is summed up with the major vessels to form the segmented blood vessels. The segmented images are subjected to feature extraction process, where the features like proposed Local Binary Pattern (LBP), Gray-Level Co-Occurrence Matrix (GLCM) and Gray Level Run Length Matrix (GLRM) are extracted. As the curse of dimensionality seems to be the greatest issue, it is important to select the appropriate features from the extracted one for classification. In this paper, a new improved optimization algorithm Moth Flame with New Distance Formulation (MF-NDF) is introduced for selecting the optimal features. Finally, the selected optimal features are subjected to Deep Convolutional Neural Network (DCNN) model for classification. Further, in order to make the precise diagnosis, the weights of DCNN are optimally tuned by the same optimization algorithm. The performance of the proposed algorithm will be compared against the conventional algorithms in terms of positive and negative measures.


2021 ◽  
Vol 9 ◽  
Author(s):  
Ashwini K ◽  
P. M. Durai Raj Vincent ◽  
Kathiravan Srinivasan ◽  
Chuan-Yu Chang

Neonatal infants communicate with us through cries. The infant cry signals have distinct patterns depending on the purpose of the cries. Preprocessing, feature extraction, and feature selection need expert attention and take much effort in audio signals in recent days. In deep learning techniques, it automatically extracts and selects the most important features. For this, it requires an enormous amount of data for effective classification. This work mainly discriminates the neonatal cries into pain, hunger, and sleepiness. The neonatal cry auditory signals are transformed into a spectrogram image by utilizing the short-time Fourier transform (STFT) technique. The deep convolutional neural network (DCNN) technique takes the spectrogram images for input. The features are obtained from the convolutional neural network and are passed to the support vector machine (SVM) classifier. Machine learning technique classifies neonatal cries. This work combines the advantages of machine learning and deep learning techniques to get the best results even with a moderate number of data samples. The experimental result shows that CNN-based feature extraction and SVM classifier provides promising results. While comparing the SVM-based kernel techniques, namely radial basis function (RBF), linear and polynomial, it is found that SVM-RBF provides the highest accuracy of kernel-based infant cry classification system provides 88.89% accuracy.


2020 ◽  
Vol 17 (4) ◽  
pp. 572-578
Author(s):  
Mohammad Parseh ◽  
Mohammad Rahmanimanesh ◽  
Parviz Keshavarzi

Persian handwritten digit recognition is one of the important topics of image processing which significantly considered by researchers due to its many applications. The most important challenges in Persian handwritten digit recognition is the existence of various patterns in Persian digit writing that makes the feature extraction step to be more complicated.Since the handcraft feature extraction methods are complicated processes and their performance level are not stable, most of the recent studies have concentrated on proposing a suitable method for automatic feature extraction. In this paper, an automatic method based on machine learning is proposed for high-level feature extraction from Persian digit images by using Convolutional Neural Network (CNN). After that, a non-linear multi-class Support Vector Machine (SVM) classifier is used for data classification instead of fully connected layer in final layer of CNN. The proposed method has been applied to HODA dataset and obtained 99.56% of recognition rate. Experimental results are comparable with previous state-of-the-art methods


2018 ◽  
Vol 10 (3) ◽  
pp. 478-495 ◽  
Author(s):  
Ibrahim Aljarah ◽  
Ala’ M. Al-Zoubi ◽  
Hossam Faris ◽  
Mohammad A. Hassonah ◽  
Seyedali Mirjalili ◽  
...  

Content-Based Image Retrieval (CBIR) is extensively used technique for image retrieval from large image databases. However, users are not satisfied with the conventional image retrieval techniques. In addition, the advent of web development and transmission networks, the number of images available to users continues to increase. Therefore, a permanent and considerable digital image production in many areas takes place. Quick access to the similar images of a given query image from this extensive collection of images pose great challenges and require proficient techniques. From query by image to retrieval of relevant images, CBIR has key phases such as feature extraction, similarity measurement, and retrieval of relevant images. However, extracting the features of the images is one of the important steps. Recently Convolutional Neural Network (CNN) shows good results in the field of computer vision due to the ability of feature extraction from the images. Alex Net is a classical Deep CNN for image feature extraction. We have modified the Alex Net Architecture with a few changes and proposed a novel framework to improve its ability for feature extraction and for similarity measurement. The proposal approach optimizes Alex Net in the aspect of pooling layer. In particular, average pooling is replaced by max-avg pooling and the non-linear activation function Maxout is used after every Convolution layer for better feature extraction. This paper introduces CNN for features extraction from images in CBIR system and also presents Euclidean distance along with the Comprehensive Values for better results. The proposed framework goes beyond image retrieval, including the large-scale database. The performance of the proposed work is evaluated using precision. The proposed work show better results than existing works.


Author(s):  
Zhixian Chen ◽  
Jialin Tang ◽  
Xueyuan Gong ◽  
Qinglang Su

In order to improve the low accuracy of the face recognition methods in the case of e-health, this paper proposed a novel face recognition approach, which is based on convolutional neural network (CNN). In detail, through resolving the convolutional kernel, rectified linear unit (ReLU) activation function, dropout, and batch normalization, this novel approach reduces the number of parameters of the CNN model, improves the non-linearity of the CNN model, and alleviates overfitting of the CNN model. In these ways, the accuracy of face recognition is increased. In the experiments, the proposed approach is compared with principal component analysis (PCA) and support vector machine (SVM) on ORL, Cohn-Kanade, and extended Yale-B face recognition data set, and it proves that this approach is promising.


2021 ◽  
Vol 11 (1) ◽  
Author(s):  
Peng Qin ◽  
Hongping Hu ◽  
Zhengmin Yang

AbstractGrasshopper optimization algorithm (GOA) proposed in 2017 mimics the behavior of grasshopper swarms in nature for solving optimization problems. In the basic GOA, the influence of the gravity force on the updated position of every grasshopper is not considered, which possibly causes GOA to have the slower convergence speed. Based on this, the improved GOA (IGOA) is obtained by the two updated ways of the position of every grasshopper in this paper. One is that the gravity force is introduced into the updated position of every grasshopper in the basic GOA. And the other is that the velocity is introduced into the updated position of every grasshopper and the new position are obtained from the sum of the current position and the velocity. Then every grasshopper adopts its suitable way of the updated position on the basis of the probability. Finally, IGOA is firstly performed on the 23 classical benchmark functions and then is combined with BP neural network to establish the predicted model IGOA-BPNN by optimizing the parameters of BP neural network for predicting the closing prices of the Shanghai Stock Exchange Index and the air quality index (AQI) of Taiyuan, Shanxi Province. The experimental results show that IGOA is superior to the compared algorithms in term of the average values and the predicted model IGOA-BPNN has the minimal predicted errors. Therefore, the proposed IGOA is an effective and efficient algorithm for optimization.


Sign in / Sign up

Export Citation Format

Share Document