Transfer learning using AlexNet Convolutional Neural Network for Face Recognition

This research is aimed to achieve high-precision accuracy and for face recognition system. Convolution Neural Network is one of the Deep Learning approaches and has demonstrated excellent performance in many fields, including image recognition of a large amount of training data (such as ImageNet). In fact, hardware limitations and insufficient training data-sets are the challenges of getting high performance. Therefore, in this work the Deep Transfer Learning method using AlexNet pre-trained CNN is proposed to improve the performance of the face-recognition system even for a smaller number of images. The transfer learning method is used to fine-tuning on the last layer of AlexNet CNN model for new classification tasks. The data augmentation (DA) technique also proposed to minimize the over-fitting problem during Deep transfer learning training and to improve accuracy. The results proved the improvement in over-fitting and in performance after using the data augmentation technique. All the experiments were tested on UTeMFD, GTFD, and CASIA-Face V5 small data-sets. As a result, the proposed system achieved a high accuracy as 100% on UTeMFD, 96.67% on GTFD, and 95.60% on CASIA-Face V5 in less than 0.05 seconds of recognition time.

Download Full-text

Robust and discriminating face recognition system based on a neural network and correlation techniques

2020 Tenth International Conference on Image Processing Theory, Tools and Applications (IPTA) ◽

10.1109/ipta50016.2020.9286617 ◽

2020 ◽

Author(s):

Ehsan SEDGH GOOYA ◽

Ayman AL FALOU ◽

Wissam KADDAH

Keyword(s):

Neural Network ◽

Face Recognition ◽

Recognition System ◽

Face Recognition System ◽

Correlation Techniques

Download Full-text

Tomato pest classification using deep convolutional neural network with transfer learning, fine tuning and scratch learning

Intelligent Decision Technologies ◽

10.3233/idt-200192 ◽

2021 ◽

pp. 1-10

Author(s):

Gayatri Pattnaik ◽

Vimal K. Shrivastava ◽

K. Parvathi

Keyword(s):

Neural Network ◽

Convolutional Neural Network ◽

Transfer Learning ◽

Data Augmentation ◽

State Of The Art ◽

Deep Convolutional Neural Network ◽

Fine Tuning ◽

Tomato Plants ◽

Random Weights

Pests are major threat to economic growth of a country. Application of pesticide is the easiest way to control the pest infection. However, excessive utilization of pesticide is hazardous to environment. The recent advances in deep learning have paved the way for early detection and improved classification of pest in tomato plants which will benefit the farmers. This paper presents a comprehensive analysis of 11 state-of-the-art deep convolutional neural network (CNN) models with three configurations: transfers learning, fine-tuning and scratch learning. The training in transfer learning and fine tuning initiates from pre-trained weights whereas random weights are used in case of scratch learning. In addition, the concept of data augmentation has been explored to improve the performance. Our dataset consists of 859 tomato pest images from 10 categories. The results demonstrate that the highest classification accuracy of 94.87% has been achieved in the transfer learning approach by DenseNet201 model with data augmentation.

Download Full-text

Oversampling Based on Data Augmentation in Convolutional Neural Network for Silicon Wafer Defect Classification

Knowledge Innovation Through Intelligent Software Methodologies, Tools and Techniques - Frontiers in Artificial Intelligence and Applications ◽

10.3233/faia200547 ◽

2020 ◽

Author(s):

Uzma Batool ◽

Mohd Ibrahim Shapiai ◽

Nordinah Ismail ◽

Hilman Fauzi ◽

Syahrizal Salleh

Keyword(s):

Neural Network ◽

Deep Learning ◽

Convolutional Neural Network ◽

Silicon Wafer ◽

Data Augmentation ◽

Imbalanced Data ◽

Training Data ◽

Defect Classification ◽

Learning Method ◽

Test Set

Silicon wafer defect data collected from fabrication facilities is intrinsically imbalanced because of the variable frequencies of defect types. Frequently occurring types will have more influence on the classification predictions if a model gets trained on such skewed data. A fair classifier for such imbalanced data requires a mechanism to deal with type imbalance in order to avoid biased results. This study has proposed a convolutional neural network for wafer map defect classification, employing oversampling as an imbalance addressing technique. To have an equal participation of all classes in the classifier’s training, data augmentation has been employed, generating more samples in minor classes. The proposed deep learning method has been evaluated on a real wafer map defect dataset and its classification results on the test set returned a 97.91% accuracy. The results were compared with another deep learning based auto-encoder model demonstrating the proposed method, a potential approach for silicon wafer defect classification that needs to be investigated further for its robustness.

Download Full-text

PCA and back-propagation neural network based face recognition system

2015 18th International Conference on Computer and Information Technology (ICCIT) ◽

10.1109/iccitechn.2015.7488138 ◽

2015 ◽

Cited By ~ 9

Author(s):

Firoz Mahmud ◽

Shyla Afroge ◽

Md. Al Mamun ◽

Abdul Matin

Keyword(s):

Neural Network ◽

Face Recognition ◽

Back Propagation ◽

Recognition System ◽

Back Propagation Neural Network ◽

Face Recognition System

Download Full-text

A TIED-MIXTURE 2D HMM FACIAL IMAGE RETRIEVAL SYSTEM

Journal of Circuits System and Computers ◽

10.1142/s0218126604001799 ◽

2004 ◽

Vol 13 (05) ◽

pp. 1133-1146

Author(s):

H. OTHMAN ◽

T. ABOULNASR

Keyword(s):

Face Recognition ◽

Recognition System ◽

Training Data ◽

Expected Improvement ◽

Model Case ◽

Face Recognition System ◽

Image Retrieval System ◽

The Face ◽

Institute Of Technology ◽

Observation Space

In this paper, the effect of mixture tying on a second-order 2D Hidden Markov Model (HMM) is studied as applied to the face recognition problem. While tying HMM parameters is a well-known solution in the case of insufficient training data that leads to nonrobust estimation, it is used here to improve the overall performance in the small model case where the resolution in the observation space is the main problem. The fully-tied-mixture 2D HMM-based face recognition system is applied to the facial database of AT&T and the facial database of Georgia Institute of Technology. The performance of the proposed 2D HMM tied-mixture system is studied and the expected improvement is confirmed.

Download Full-text

The Comparison of Laplacianfaces QR Decomposition and Linear Discriminant Analysis QR Decomposition Algorithm for Face Recognition System on Orthogonal Subspace

Jurnal Teknologi ◽

10.11113/jt.v71.3612 ◽

2014 ◽

Vol 71 (1) ◽

Cited By ~ 1

Author(s):

Purbandini Purbandini

Keyword(s):

Face Recognition ◽

Discriminant Analysis ◽

Linear Discriminant Analysis ◽

Selection Process ◽

Recognition System ◽

Qr Decomposition ◽

Training Data ◽

Distance Method ◽

Linear Discriminant ◽

Face Recognition System

Development of an optimal face recognition system will greatly depend on the characteristics of the selection process are as a basis to pattern recognition. In the characteristic selection process, there are 2 aspects that will be of mutual influence such the reduction of the amount of data used in the classification aspects and increasing discrimination ability aspects. Linear Discriminat Analysis method helps presenting the global structure while Laplacianfaces method is one method that is based on appearance (appearance-based method) in face recognition, in which the local manifold structure presented in the adjacency graph mapped from the training data points. Linear Discriminant Analysis QR decomposition has a computationally low cost because it has small dimensions so that the efficiency and scalability are very high when compared with algorithms of other Linear Discriminant Analysis methods. Laplacianfaces QR decomposition was a algorithm to obtain highly speed and accuracy, and tiny space to keep data on the face recognition. This algorithm consists of 2 stages. The first stage maximizes the distance of between-class scatter matrices by using QR decomposition and the second stage to minimize the distance of within-class scatter matrices. Therefore, it is obtained an optimal discriminant in the data. In this research, classification using the Euclidean distance method. In these experiments using face databases of the Olivetti-Att-ORL, Bern and Yale. The minimum error was achieved with the Laplacianfaces QR decomposition and Linear Discriminant Analysis QR decomposition are 5.88% and 9.08% respectively.

Download Full-text

Ambulatory and Laboratory Stress Detection Based on Raw Electrocardiogram Signals Using a Convolutional Neural Network

Sensors ◽

10.3390/s19204408 ◽

2019 ◽

Vol 19 (20) ◽

pp. 4408 ◽

Cited By ~ 2

Author(s):

Hyun-Myung Cho ◽

Heesu Park ◽

Suh-Yeon Dong ◽

Inchan Youn

Keyword(s):

Neural Network ◽

Transfer Learning ◽

Mental Arithmetic ◽

Small Data ◽

Data Sets ◽

Learning Method ◽

Data Set ◽

Electrocardiogram Signals ◽

Proposed Model ◽

Small Data Set

The goals of this study are the suggestion of a better classification method for detecting stressed states based on raw electrocardiogram (ECG) data and a method for training a deep neural network (DNN) with a smaller data set. We suggest an end-to-end architecture to detect stress using raw ECGs. The architecture consists of successive stages that contain convolutional layers. In this study, two kinds of data sets are used to train and validate the model: A driving data set and a mental arithmetic data set, which smaller than the driving data set. We apply a transfer learning method to train a model with a small data set. The proposed model shows better performance, based on receiver operating curves, than conventional methods. Compared with other DNN methods using raw ECGs, the proposed model improves the accuracy from 87.39% to 90.19%. The transfer learning method improves accuracy by 12.01% and 10.06% when 10 s and 60 s of ECG signals, respectively, are used in the model. In conclusion, our model outperforms previous models using raw ECGs from a small data set and, so, we believe that our model can significantly contribute to mobile healthcare for stress management in daily life.

Download Full-text

Hybrid Approach for Face Recognition from a Single Sample per Person by Combining VLC and GOM

Journal of Intelligent Systems ◽

10.1515/jisys-2018-0380 ◽

2019 ◽

Vol 29 (1) ◽

pp. 1523-1534 ◽

Cited By ~ 1

Author(s):

Ahmed Ghorbel ◽

Walid Aydi ◽

Imen Tajouri ◽

Nouri Masmoudi

Keyword(s):

Feature Extraction ◽

Face Recognition ◽

Execution Time ◽

Hybrid Approach ◽

Recognition System ◽

Single Sample ◽

Data Sets ◽

Extraction Techniques ◽

Face Recognition System ◽

Execution Speed

Abstract This paper proposes a new face recognition system based on combining two feature extraction techniques: the Vander Lugt correlator (VLC) and Gabor ordinal measures (GOM). The proposed system relies on the execution speed of VLC and the robustness of GOM. In this system, we applied the Tan and Triggs and retina modeling enhancement techniques, which are well suited for VLC and GOM, respectively. We evaluated our system on the standard FERET probe data sets and on extended YaleB database. The obtained results exhibited better face recognition rates in a shorter execution time compared to the GOM technique.

Download Full-text