Recognize Number of fingers from Single hand gesture Image using Image processing and Neural Network

Automatic classification of dynamic hand gesture is challenging due to the large diversity in a different class of gesture, Low resolution, and it is performed by finger. Due to a number of challenges many researchers focus on this area. Recently deep neural network can be used for implicit feature extraction and Soft Max layer is used for classification. In this paper, we propose a method based on a two-dimensional convolutional neural network that performs detection and classification of hand gesture simultaneously from multimodal Red, Green, Blue, Depth (RGBD) and Optical flow Data and passes this feature to Long-Short Term Memory (LSTM) recurrent network for frame-to-frame probability generation with Connectionist Temporal Classification (CTC) network for loss calculation. We have calculated an optical flow from Red, Green, Blue (RGB) data for getting proper motion information present in the video. CTC model is used to efficiently evaluate all possible alignment of hand gesture via dynamic programming and check consistency via frame-to-frame for the visual similarity of hand gesture in the unsegmented input stream. CTC network finds the most probable sequence of a frame for a class of gesture. The frame with the highest probability value is selected from the CTC network by max decoding. This entire CTC network is trained end-to-end with calculating CTC loss for recognition of the gesture. We have used challenging Vision for Intelligent Vehicles and Applications (VIVA) dataset for dynamic hand gesture recognition captured with RGB and Depth data. On this VIVA dataset, our proposed hand gesture recognition technique outperforms competing state-of-the-art algorithms and gets an accuracy of 86%

Download Full-text

Trauma Identification Using Image Processing and FeedForward Neural Network

SSRN Electronic Journal ◽

10.2139/ssrn.3734767 ◽

2020 ◽

Author(s):

Sofia R

Keyword(s):

Neural Network ◽

Image Processing ◽

Feedforward Neural Network

Download Full-text

Convolutional Neural Network for the Semantic Segmentation of Remote Sensing Images

Mobile Networks and Applications ◽

10.1007/s11036-020-01703-3 ◽

2021 ◽

Vol 26 (1) ◽

pp. 200-215

Author(s):

Muhammad Alam ◽

Jian-Feng Wang ◽

Cong Guangpei ◽

LV Yunrong ◽

Yuanfang Chen

Keyword(s):

Neural Network ◽

Remote Sensing ◽

Neural Networks ◽

Image Processing ◽

Deep Learning ◽

Semantic Segmentation ◽

Natural Scene ◽

Remote Sensing Images ◽

Advantages And Disadvantages ◽

Target Segmentation

AbstractIn recent years, the success of deep learning in natural scene image processing boosted its application in the analysis of remote sensing images. In this paper, we applied Convolutional Neural Networks (CNN) on the semantic segmentation of remote sensing images. We improve the Encoder- Decoder CNN structure SegNet with index pooling and U-net to make them suitable for multi-targets semantic segmentation of remote sensing images. The results show that these two models have their own advantages and disadvantages on the segmentation of different objects. In addition, we propose an integrated algorithm that integrates these two models. Experimental results show that the presented integrated algorithm can exploite the advantages of both the models for multi-target segmentation and achieve a better segmentation compared to these two models.

Download Full-text

Real-Time Greenhouse Environmental Conditions Optimization Using Neural Network and Image Processing

2020 20th International Conference on Advances in ICT for Emerging Regions (ICTer) ◽

10.1109/icter51097.2020.9325472 ◽

2020 ◽

Author(s):

Piyumi Wickramaarachchi ◽

Niroshan Balasooriya ◽

Lakmal Welipenne ◽

Sachintha Gunasekara ◽

Anuradha Jayakody

Keyword(s):

Neural Network ◽

Image Processing ◽

Real Time ◽

Environmental Conditions

Download Full-text

Diabetic Retinal Grading Using Attention-Based Bilinear Convolutional Neural Network and Complement Cross Entropy

Entropy ◽

10.3390/e23070816 ◽

2021 ◽

Vol 23 (7) ◽

pp. 816

Author(s):

Pingping Liu ◽

Xiaokang Yang ◽

Baixin Jin ◽

Qiuzhan Zhou

Keyword(s):

Neural Network ◽

Image Processing ◽

Diabetic Retinopathy ◽

Convolutional Neural Network ◽

Image Classification ◽

Network Model ◽

Rapid Development ◽

Image Data ◽

Lesion Detection ◽

Great Success

Diabetic retinopathy (DR) is a common complication of diabetes mellitus (DM), and it is necessary to diagnose DR in the early stages of treatment. With the rapid development of convolutional neural networks in the field of image processing, deep learning methods have achieved great success in the field of medical image processing. Various medical lesion detection systems have been proposed to detect fundus lesions. At present, in the image classification process of diabetic retinopathy, the fine-grained properties of the diseased image are ignored and most of the retinopathy image data sets have serious uneven distribution problems, which limits the ability of the network to predict the classification of lesions to a large extent. We propose a new non-homologous bilinear pooling convolutional neural network model and combine it with the attention mechanism to further improve the network’s ability to extract specific features of the image. The experimental results show that, compared with the most popular fundus image classification models, the network model we proposed can greatly improve the prediction accuracy of the network while maintaining computational efficiency.

Download Full-text

Automated Classification of Alzheimer’s Disease Based on MRI Image Processing using Convolutional Neural Network (CNN) with AlexNet Architecture

Journal of Physics Conference Series ◽

10.1088/1742-6596/1844/1/012020 ◽

2021 ◽

Vol 1844 (1) ◽

pp. 012020

Author(s):

Y N Fu’adah ◽

I Wijayanto ◽

N K C Pratiwi ◽

F F Taliningsih ◽

S Rizal ◽

...

Keyword(s):

Neural Network ◽

Alzheimer’S Disease ◽

Image Processing ◽

Alzheimer's Disease ◽

Convolutional Neural Network ◽

Automated Classification ◽

Mri Image

Download Full-text

Multi-Digit Recognition using Image Processing and Neural Network

2021 International Conference on Emerging Smart Computing and Informatics (ESCI) ◽

10.1109/esci50559.2021.9397021 ◽

2021 ◽

Author(s):

Anway Shirgaonkar ◽

Neeraj Sahasrabudhe ◽

Prathamesh Sandikar ◽

Tapan Sawant ◽

Shahid Sayyad ◽

...

Keyword(s):

Neural Network ◽

Image Processing ◽

Digit Recognition

Download Full-text

Dynamic Hand Gesture Pattern Recognition Using Probabilistic Neural Network

2021 IEEE International IOT, Electronics and Mechatronics Conference (IEMTRONICS) ◽

10.1109/iemtronics52119.2021.9422496 ◽

2021 ◽

Author(s):

Debasish Bal ◽

Asif Mohammed Arfi ◽

Sujoy Dey

Keyword(s):

Neural Network ◽

Pattern Recognition ◽

Probabilistic Neural Network ◽

Hand Gesture

Download Full-text

Analysis of the Nosema Cells Identification for Microscopic Images

Sensors ◽

10.3390/s21093068 ◽

2021 ◽

Vol 21 (9) ◽

pp. 3068

Author(s):

Soumaya Dghim ◽

Carlos M. Travieso-González ◽

Radim Burget

Keyword(s):

Neural Network ◽

Machine Learning ◽

Image Processing ◽

Deep Learning ◽

The Other ◽

Support Vector ◽

Learning Approaches ◽

Microscopic Images ◽

Trained Neural Network ◽

Nosema Disease

The use of image processing tools, machine learning, and deep learning approaches has become very useful and robust in recent years. This paper introduces the detection of the Nosema disease, which is considered to be one of the most economically significant diseases today. This work shows a solution for recognizing and identifying Nosema cells between the other existing objects in the microscopic image. Two main strategies are examined. The first strategy uses image processing tools to extract the most valuable information and features from the dataset of microscopic images. Then, machine learning methods are applied, such as a neural network (ANN) and support vector machine (SVM) for detecting and classifying the Nosema disease cells. The second strategy explores deep learning and transfers learning. Several approaches were examined, including a convolutional neural network (CNN) classifier and several methods of transfer learning (AlexNet, VGG-16 and VGG-19), which were fine-tuned and applied to the object sub-images in order to identify the Nosema images from the other object images. The best accuracy was reached by the VGG-16 pre-trained neural network with 96.25%.

Download Full-text

Surface EMG-Based Instantaneous Hand Gesture Recognition Using Convolutional Neural Network with the Transfer Learning Method

Sensors ◽

10.3390/s21072540 ◽

2021 ◽

Vol 21 (7) ◽

pp. 2540

Author(s):

Zhipeng Yu ◽

Jianghai Zhao ◽

Yucheng Wang ◽

Linglong He ◽

Shaonan Wang

Keyword(s):

Neural Network ◽

Convolutional Neural Network ◽

Transfer Learning ◽

Gesture Recognition ◽

Recognition System ◽

Surface Emg ◽

Hand Gesture Recognition ◽

Hand Gesture ◽

Training Time ◽

Generalization Ability

In recent years, surface electromyography (sEMG)-based human–computer interaction has been developed to improve the quality of life for people. Gesture recognition based on the instantaneous values of sEMG has the advantages of accurate prediction and low latency. However, the low generalization ability of the hand gesture recognition method limits its application to new subjects and new hand gestures, and brings a heavy training burden. For this reason, based on a convolutional neural network, a transfer learning (TL) strategy for instantaneous gesture recognition is proposed to improve the generalization performance of the target network. CapgMyo and NinaPro DB1 are used to evaluate the validity of our proposed strategy. Compared with the non-transfer learning (non-TL) strategy, our proposed strategy improves the average accuracy of new subject and new gesture recognition by 18.7% and 8.74%, respectively, when up to three repeated gestures are employed. The TL strategy reduces the training time by a factor of three. Experiments verify the transferability of spatial features and the validity of the proposed strategy in improving the recognition accuracy of new subjects and new gestures, and reducing the training burden. The proposed TL strategy provides an effective way of improving the generalization ability of the gesture recognition system.

Download Full-text