Real Time Face Recognition with limited training data: Feature Transfer Learning integrating CNN and Sparse Approximation

Mapping Intimacies ◽

10.1101/2021.03.17.435457 ◽

2021 ◽

Author(s):

Gargi Mishra ◽

Supriya Bajpai

Keyword(s):

Deep Learning ◽

Face Recognition ◽

Real Time ◽

High Performance ◽

Sparse Approximation ◽

Training Data ◽

Test Image ◽

Learning Methods ◽

Training Time ◽

Training Images

It is highly challenging to obtain high performance with limited and unconstrained data in real time face recognition applications. Sparse Approximation is a fast and computationally efficient method for the above application as it requires no training time as compared to deep learning methods. It eliminates the training time by assuming that the test image can be approximated by the sum of individual contributions of the training images from different classes and the class with maximum contribution is closest to the test image. The efficiency of the Sparse Approximation method can be further increased by providing high quality features as input for classification. Hence, we propose to integrate pre-trained CNN architecture to extract the highly discriminative features from the image dataset for Sparse classification. The proposed approach provides better performance even for one training image per class in complex environment as compared to the existing methods. Highlight of the present approach is the results obtained for LFW dataset with one and thirteen training images per class are 84.86% and 96.14% respectively, whereas the existing deep learning methods use a large amount of training data to achieve comparable results.

Download Full-text

Wide Sliding Window and Subsampling Network for Hyperspectral Image Classification

Remote Sensing ◽

10.3390/rs13071290 ◽

2021 ◽

Vol 13 (7) ◽

pp. 1290

Author(s):

Jiangbo Xi ◽

Okan K. Ersoy ◽

Jianwu Fang ◽

Ming Cong ◽

Tianjun Wu ◽

...

Keyword(s):

Deep Learning ◽

High Performance ◽

Large Scale ◽

Hyperspectral Image ◽

Sliding Window ◽

Spectral Features ◽

Learning Methods ◽

Training Time ◽

Sliding Windows ◽

Training Samples

Recently, deep learning methods, for example, convolutional neural networks (CNNs), have achieved high performance in hyperspectral image (HSI) classification. The limited training samples of HSI images make it hard to use deep learning methods with many layers and a large number of convolutional kernels as in large scale imagery tasks, and CNN-based methods usually need long training time. In this paper, we present a wide sliding window and subsampling network (WSWS Net) for HSI classification. It is based on layers of transform kernels with sliding windows and subsampling (WSWS). It can be extended in the wide direction to learn both spatial and spectral features more efficiently. The learned features are subsampled to reduce computational loads and to reduce memorization. Thus, layers of WSWS can learn higher level spatial and spectral features efficiently, and the proposed network can be trained easily by only computing linear weights with least squares. The experimental results show that the WSWS Net achieves excellent performance with different hyperspectral remotes sensing datasets compared with other shallow and deep learning methods. The effects of ratio of training samples, the sizes of image patches, and the visualization of features in WSWS layers are presented.

Download Full-text

Effect of data-augmentation on fine-tuned CNN model performance

IAES International Journal of Artificial Intelligence (IJ-AI) ◽

10.11591/ijai.v10.i1.pp84-92 ◽

2021 ◽

Vol 10 (1) ◽

pp. 84

Author(s):

Ramaprasad Poojary ◽

Roma Raina ◽

Amit Kumar Mondal

Keyword(s):

Neural Network ◽

Computer Vision ◽

Deep Learning ◽

High Performance ◽

Data Augmentation ◽

Model Performance ◽

Training Data ◽

Fine Tuning ◽

Test Accuracy ◽

Training Time

<span id="docs-internal-guid-cdb76bbb-7fff-978d-961c-e21c41807064"><span>During the last few years, deep learning achieved remarkable results in the field of machine learning when used for computer vision tasks. Among many of its architectures, deep neural network-based architecture known as convolutional neural networks are recently used widely for image detection and classification. Although it is a great tool for computer vision tasks, it demands a large amount of training data to yield high performance. In this paper, the data augmentation method is proposed to overcome the challenges faced due to a lack of insufficient training data. To analyze the effect of data augmentation, the proposed method uses two convolutional neural network architectures. To minimize the training time without compromising accuracy, models are built by fine-tuning pre-trained networks VGG16 and ResNet50. To evaluate the performance of the models, loss functions and accuracies are used. Proposed models are constructed using Keras deep learning framework and models are trained on a custom dataset created from Kaggle CAT vs DOG database. Experimental results showed that both the models achieved better test accuracy when data augmentation is employed, and model constructed using ResNet50 outperformed VGG16 based model with a test accuracy of 90% with data augmentation & 82% without data augmentation.</span></span>

Download Full-text

Real Time Gender and Age Prediction using Deep Learning Techniques

International Journal of Innovative Technology and Exploring Engineering - Special Issue ◽

10.35940/ijitee.e2906.049620 ◽

2020 ◽

Vol 9 (6) ◽

pp. 797-801

Keyword(s):

Deep Learning ◽

Face Recognition ◽

Real Time ◽

Vital Role ◽

Learning Techniques ◽

Gender And Age ◽

The Face ◽

And Gender ◽

Gender Detection ◽

Age Prediction

Face recognition plays a vital role in security purpose. In recent years, the researchers have focused on the pose illumination, face recognition, etc,. The traditional methods of face recognition focus on Open CV’s fisher faces which results in analyzing the face expressions and attributes. Deep learning method used in this proposed system is Convolutional Neural Network (CNN). Proposed work includes the following modules: [1] Face Detection [2] Gender Recognition [3] Age Prediction. Thus the results obtained from this work prove that real time age and gender detection using CNN provides better accuracy results compared to other existing approaches.

Download Full-text

Computational Complexity Reduction of Neural Networks of Brain Tumor Image Segmentation by Introducing Fermi–Dirac Correction Functions

Entropy ◽

10.3390/e23020223 ◽

2021 ◽

Vol 23 (2) ◽

pp. 223

Author(s):

Yen-Ling Tai ◽

Shin-Jhe Huang ◽

Chien-Chang Chen ◽

Henry Horng-Shing Lu

Keyword(s):

Neural Networks ◽

Deep Learning ◽

Computational Complexity ◽

High Performance ◽

Low Cost ◽

Structural Complexity ◽

Correction Function ◽

Computational Time ◽

Learning Methods ◽

Band Theory

Nowadays, deep learning methods with high structural complexity and flexibility inevitably lean on the computational capability of the hardware. A platform with high-performance GPUs and large amounts of memory could support neural networks having large numbers of layers and kernels. However, naively pursuing high-cost hardware would probably drag the technical development of deep learning methods. In the article, we thus establish a new preprocessing method to reduce the computational complexity of the neural networks. Inspired by the band theory of solids in physics, we map the image space into a noninteraction physical system isomorphically and then treat image voxels as particle-like clusters. Then, we reconstruct the Fermi–Dirac distribution to be a correction function for the normalization of the voxel intensity and as a filter of insignificant cluster components. The filtered clusters at the circumstance can delineate the morphological heterogeneity of the image voxels. We used the BraTS 2019 datasets and the dimensional fusion U-net for the algorithmic validation, and the proposed Fermi–Dirac correction function exhibited comparable performance to other employed preprocessing methods. By comparing to the conventional z-score normalization function and the Gamma correction function, the proposed algorithm can save at least 38% of computational time cost under a low-cost hardware architecture. Even though the correction function of global histogram equalization has the lowest computational time among the employed correction functions, the proposed Fermi–Dirac correction function exhibits better capabilities of image augmentation and segmentation.

Download Full-text

Real-time multiple object tracking using deep learning methods

Neural Computing and Applications ◽

10.1007/s00521-021-06391-y ◽

2021 ◽

Author(s):

Dimitrios Meimetis ◽

Ioannis Daramouskas ◽

Isidoros Perikos ◽

Ioannis Hatzilygeroudis

Keyword(s):

Deep Learning ◽

Object Tracking ◽

Real Time ◽

Multiple Object Tracking ◽

Learning Methods ◽

Multiple Object

Download Full-text

Robust Approach to Supervised Deep Neural Network Training for Real-Time Object Classification in Cluttered Indoor Environment

Applied Sciences ◽

10.3390/app11157148 ◽

2021 ◽

Vol 11 (15) ◽

pp. 7148

Author(s):

Bedada Endale ◽

Abera Tullu ◽

Hayoung Shi ◽

Beom-Soo Kang

Keyword(s):

Neural Network ◽

Deep Learning ◽

Real Time ◽

Network Architecture ◽

Input Data ◽

Deep Neural Network ◽

Data Augmentation ◽

Object Classification ◽

Training Data ◽

Gradient Descent Algorithm

Unmanned aerial vehicles (UAVs) are being widely utilized for various missions: in both civilian and military sectors. Many of these missions demand UAVs to acquire artificial intelligence about the environments they are navigating in. This perception can be realized by training a computing machine to classify objects in the environment. One of the well known machine training approaches is supervised deep learning, which enables a machine to classify objects. However, supervised deep learning comes with huge sacrifice in terms of time and computational resources. Collecting big input data, pre-training processes, such as labeling training data, and the need for a high performance computer for training are some of the challenges that supervised deep learning poses. To address these setbacks, this study proposes mission specific input data augmentation techniques and the design of light-weight deep neural network architecture that is capable of real-time object classification. Semi-direct visual odometry (SVO) data of augmented images are used to train the network for object classification. Ten classes of 10,000 different images in each class were used as input data where 80% were for training the network and the remaining 20% were used for network validation. For the optimization of the designed deep neural network, a sequential gradient descent algorithm was implemented. This algorithm has the advantage of handling redundancy in the data more efficiently than other algorithms.

Download Full-text

Real-time crowd behavior recognition in surveillance videos based on deep learning methods

Journal of Real-Time Image Processing ◽

10.1007/s11554-021-01116-9 ◽

2021 ◽

Author(s):

Fariba Rezaei ◽

Mehran Yazdi

Keyword(s):

Deep Learning ◽

Real Time ◽

Crowd Behavior ◽

Behavior Recognition ◽

Learning Methods ◽

Surveillance Videos

Download Full-text

Batch Effect Removal via Batch-Free Encoding

10.1101/380816 ◽

2018 ◽

Cited By ~ 1

Author(s):

Uri Shaham

Keyword(s):

Deep Learning ◽

Biological Properties ◽

Batch Effect ◽

Training Data ◽

Rna Seq ◽

Batch Effects ◽

Training Time ◽

Learning Techniques ◽

Downstream Analysis ◽

Biological Patterns

AbstractBiological measurements often contain systematic errors, also known as “batch effects”, which may invalidate downstream analysis when not handled correctly. The problem of removing batch effects is of major importance in the biological community. Despite recent advances in this direction via deep learning techniques, most current methods may not fully preserve the true biological patterns the data contains. In this work we propose a deep learning approach for batch effect removal. The crux of our approach is learning a batch-free encoding of the data, representing its intrinsic biological properties, but not batch effects. In addition, we also encode the systematic factors through a decoding mechanism and require accurate reconstruction of the data. Altogether, this allows us to fully preserve the true biological patterns represented in the data. Experimental results are reported on data obtained from two high throughput technologies, mass cytometry and single-cell RNA-seq. Beyond good performance on training data, we also observe that our system performs well on test data obtained from new patients, which was not available at training time. Our method is easy to handle, a publicly available code can be found at https://github.com/ushaham/BatchEffectRemoval2018.

Download Full-text

Deep Learning of COVID-19 Chest X-Rays: New Models or Fine Tuning?

10.36227/techrxiv.12656948.v1 ◽

2020 ◽

Author(s):

Tuan Pham

Keyword(s):

Deep Learning ◽

High Performance ◽

Data Augmentation ◽

Dominant Role ◽

Care Center ◽

Characteristic Curve ◽

Fine Tuning ◽

Urgent Care ◽

X Rays ◽

Training Time

Chest X-rays have been found to be very promising for assessing COVID-19 patients, especially for resolving emergency-department and urgent-care-center overcapacity. Deep-learning (DL) methods in artificial intelligence (AI) play a dominant role as high-performance classifiers in the detection of the disease using chest X-rays. While many new DL models have been being developed for this purpose, this study aimed to investigate the fine tuning of pretrained convolutional neural networks (CNNs) for the classification of COVID-19 using chest X-rays. Three pretrained CNNs, which are AlexNet, GoogleNet, and SqueezeNet, were selected and fine-tuned without data augmentation to carry out 2-class and 3-class classification tasks using 3 public chest X-ray databases. In comparison with other recently developed DL models, the 3 pretrained CNNs achieved very high classification results in terms of accuracy, sensitivity, specificity, precision, F1 score, and area under the receiver-operating-characteristic curve. AlexNet, GoogleNet, and SqueezeNet require the least training time among pretrained DL models, but with suitable selection of training parameters, excellent classification results can be achieved without data augmentation by these networks. The findings contribute to the urgent need for harnessing the pandemic by facilitating the deployment of AI tools that are fully automated and readily available in the public domain for rapid implementation.

Download Full-text

Comparative Analysis of Human Face Recognition by Traditional Methods and Deep Learning in Real-Time Environment

2020 IEEE 9th International Conference on Communication Systems and Network Technologies (CSNT) ◽

10.1109/csnt48778.2020.9115779 ◽

2020 ◽

Author(s):

Ruchi Jayaswal ◽

Mansih Dixit

Keyword(s):

Deep Learning ◽

Face Recognition ◽

Comparative Analysis ◽

Real Time ◽

Human Face ◽

Traditional Methods ◽

Human Face Recognition

Download Full-text