Using 3D Convolutional Neural Networks to Learn Spatiotemporal Features for Automatic Surgical Gesture Recognition in Video

State-of-the-art hand gesture recognition methods have investigated the spatiotemporal features based on 3D convolutional neural networks (3DCNNs) or convolutional long short-term memory (ConvLSTM). However, they often suffer from the inefficiency due to the high computational complexity of their network structures. In this paper, we focus instead on the 1D convolutional neural networks and propose a simple and efficient architectural unit, Multi-Kernel Temporal Block (MKTB), that models the multi-scale temporal responses by explicitly applying different temporal kernels. Then, we present a Global Refinement Block (GRB), which is an attention module for shaping the global temporal features based on the cross-channel similarity. By incorporating the MKTB and GRB, our architecture can effectively explore the spatiotemporal features within tolerable computational cost. Extensive experiments conducted on public datasets demonstrate that our proposed model achieves the state-of-the-art with higher efficiency. Moreover, the proposed MKTB and GRB are plug-and-play modules and the experiments on other tasks, like video understanding and video-based person re-identification, also display their good performance in efficiency and capability of generalization.

Download Full-text

TinyRadarNN: Combining Spatial and Temporal Convolutional Neural Networks for Embedded Gesture Recognition with Short Range Radars

IEEE Internet of Things Journal ◽

10.1109/jiot.2021.3067382 ◽

2021 ◽

pp. 1-1

Author(s):

Moritz Scherer ◽

Michele Magno ◽

Jonas Erb ◽

Philipp Mayer ◽

Manuel Eggimann ◽

...

Keyword(s):

Neural Networks ◽

Convolutional Neural Networks ◽

Gesture Recognition ◽

Short Range

Download Full-text

Trajectory image based dynamic gesture recognition with convolutional neural networks

2015 15th International Conference on Control, Automation and Systems (ICCAS) ◽

10.1109/iccas.2015.7364671 ◽

2015 ◽

Cited By ~ 6

Author(s):

Ji-Ting Hu ◽

Chun-Xiao Fan ◽

Yue Ming

Keyword(s):

Neural Networks ◽

Convolutional Neural Networks ◽

Gesture Recognition ◽

Dynamic Gesture Recognition

Download Full-text

Hand Gesture Recognition Using Deep Convolutional Neural Networks

ICT Innovations 2016 - Advances in Intelligent Systems and Computing ◽

10.1007/978-3-319-68855-8_5 ◽

2017 ◽

pp. 49-58 ◽

Cited By ~ 4

Author(s):

Gjorgji Strezoski ◽

Dario Stojanovski ◽

Ivica Dimitrovski ◽

Gjorgji Madjarov

Keyword(s):

Neural Networks ◽

Convolutional Neural Networks ◽

Gesture Recognition ◽

Hand Gesture Recognition ◽

Hand Gesture ◽

Deep Convolutional Neural Networks

Download Full-text

Hand-Gesture Recognition Using Two-Antenna Doppler Radar With Deep Convolutional Neural Networks

IEEE Sensors Journal ◽

10.1109/jsen.2019.2892073 ◽

2019 ◽

Vol 19 (8) ◽

pp. 3041-3048 ◽

Cited By ~ 40

Author(s):

Sruthy Skaria ◽

Akram Al-Hourani ◽

Margaret Lech ◽

Robin J. Evans

Keyword(s):

Neural Networks ◽

Convolutional Neural Networks ◽

Gesture Recognition ◽

Doppler Radar ◽

Hand Gesture Recognition ◽

Hand Gesture ◽

Deep Convolutional Neural Networks

Download Full-text

Hand gesture recognition by means of region-based convolutional neural networks

Contemporary Engineering Sciences ◽

10.12988/ces.2017.710154 ◽

2017 ◽

Vol 10 (27) ◽

pp. 1329-1342 ◽

Cited By ~ 2

Author(s):

Javier O. Pinzon Arenas ◽

Robinson Jimenez Moreno ◽

Paula C. Useche Murillo

Keyword(s):

Neural Network ◽

Neural Networks ◽

Convolutional Neural Network ◽

Real Time ◽

Convolutional Neural Networks ◽

Gesture Recognition ◽

Hand Gesture ◽

Hand Gestures ◽

The Neural Network ◽

Average Accuracy

This paper presents the implementation of a Region-based Convolutional Neural Network focused on the recognition and localization of hand gestures, in this case 2 types of gestures: open and closed hand, in order to achieve the recognition of such gestures in dynamic backgrounds. The neural network is trained and validated, achieving a 99.4% validation accuracy in gesture recognition and a 25% average accuracy in RoI localization, which is then tested in real time, where its operation is verified through times taken for recognition, execution behavior through trained and untrained gestures, and complex backgrounds.

Download Full-text

Understanding the hand-gestures using Convolutional Neural Networks and Generative Adversial Networks

10.36227/techrxiv.14055059 ◽

2021 ◽

Author(s):

Arpita Vats

Keyword(s):

Neural Networks ◽

Real Time ◽

Convolutional Neural Networks ◽

Gesture Recognition ◽

Skin Color ◽

Recognition System ◽

Hand Tracking ◽

Hand Gesture ◽

Camshift Algorithm ◽

Hand Region

<p>In this paper, it is introduced a hand gesture recognition system to recognize the characters in the real time. The system consists of three modules: real time hand tracking, training gesture and gesture recognition using Convolutional Neural Networks. Camshift algorithm and hand blobs analysis for hand tracking are being used to obtain motion descriptors and hand region. It is fairy robust to background cluster and uses skin color for hand gesture tracking and recognition. Furthermore, the techniques have been proposed to improve the performance of the recognition and the accuracy using the approaches like selection of the training images and the adaptive threshold gesture to remove non-gesture pattern that helps to qualify an input pattern as a gesture. In the experiments, it has been tested to the vocabulary of 36 gestures including the alphabets and digits, and results effectiveness of the approach.</p>

Download Full-text

A comparison of convolutional neural networks for Kazakh sign language recognition

Eastern-European Journal of Enterprise Technologies ◽

10.15587/1729-4061.2021.241535 ◽

2021 ◽

Vol 5 (2 (113)) ◽

pp. 44-54

Author(s):

Chingiz Kenshimov ◽

Samat Mukhanov ◽

Timur Merembayev ◽

Didar Yedilkhan

Keyword(s):

Neural Networks ◽

Sign Language ◽

Convolutional Neural Networks ◽

Gesture Recognition ◽

State Of The Art ◽

Hand Gesture ◽

Language Recognition ◽

Sign Language Recognition ◽

Important Means ◽

Complex Relationships

For people with disabilities, sign language is the most important means of communication. Therefore, more and more authors of various papers and scientists around the world are proposing solutions to use intelligent hand gesture recognition systems. Such a system is aimed not only for those who wish to understand a sign language, but also speak using gesture recognition software. In this paper, a new benchmark dataset for Kazakh fingerspelling, able to train deep neural networks, is introduced. The dataset contains more than 10122 gesture samples for 42 alphabets. The alphabet has its own peculiarities as some characters are shown in motion, which may influence sign recognition. Research and analysis of convolutional neural networks, comparison, testing, results and analysis of LeNet, AlexNet, ResNet and EffectiveNet – EfficientNetB7 methods are described in the paper. EffectiveNet architecture is state-of-the-art (SOTA) and is supposed to be a new one compared to other architectures under consideration. On this dataset, we showed that the LeNet and EffectiveNet networks outperform other competing algorithms. Moreover, EffectiveNet can achieve state-of-the-art performance on nother hand gesture datasets. The architecture and operation principle of these algorithms reflect the effectiveness of their application in sign language recognition. The evaluation of the CNN model score is conducted by using the accuracy and penalty matrix. During training epochs, LeNet and EffectiveNet showed better results: accuracy and loss function had similar and close trends. The results of EffectiveNet were explained by the tools of the SHapley Additive exPlanations (SHAP) framework. SHAP explored the model to detect complex relationships between features in the images. Focusing on the SHAP tool may help to further improve the accuracy of the model

Download Full-text

Understanding the hand-gestures using Convolutional Neural Networks and Generative Adversial Networks

10.36227/techrxiv.14055059.v1 ◽

2021 ◽

Author(s):

Arpita Vats

Keyword(s):

Neural Networks ◽

Real Time ◽

Convolutional Neural Networks ◽

Gesture Recognition ◽

Skin Color ◽

Recognition System ◽

Hand Tracking ◽

Hand Gesture ◽

Camshift Algorithm ◽

Hand Region

<p>In this paper, it is introduced a hand gesture recognition system to recognize the characters in the real time. The system consists of three modules: real time hand tracking, training gesture and gesture recognition using Convolutional Neural Networks. Camshift algorithm and hand blobs analysis for hand tracking are being used to obtain motion descriptors and hand region. It is fairy robust to background cluster and uses skin color for hand gesture tracking and recognition. Furthermore, the techniques have been proposed to improve the performance of the recognition and the accuracy using the approaches like selection of the training images and the adaptive threshold gesture to remove non-gesture pattern that helps to qualify an input pattern as a gesture. In the experiments, it has been tested to the vocabulary of 36 gestures including the alphabets and digits, and results effectiveness of the approach.</p>

Download Full-text

Spatiotemporal 2D Skeleton-based Image for Dynamic Gesture Recognition Using Convolutional Neural Networks

10.1109/ro-man50785.2021.9515418 ◽

2021 ◽

Author(s):

Joao Ruivo Paulo ◽

Luis Garrote ◽

Paulo Peixoto ◽

Urbano J. Nunes

Keyword(s):

Neural Networks ◽

Convolutional Neural Networks ◽

Gesture Recognition ◽

Dynamic Gesture Recognition

Download Full-text

Using 3D Convolutional Neural Networks to Learn Spatiotemporal Features for Automatic Surgical Gesture Recognition in Video

High Performance Gesture Recognition via Effective and Efficient Temporal Modeling

TinyRadarNN: Combining Spatial and Temporal Convolutional Neural Networks for Embedded Gesture Recognition with Short Range Radars

Trajectory image based dynamic gesture recognition with convolutional neural networks

Hand Gesture Recognition Using Deep Convolutional Neural Networks

Hand-Gesture Recognition Using Two-Antenna Doppler Radar With Deep Convolutional Neural Networks

Hand gesture recognition by means of region-based convolutional neural networks

Understanding the hand-gestures using Convolutional Neural Networks and Generative Adversial Networks

A comparison of convolutional neural networks for Kazakh sign language recognition

Understanding the hand-gestures using Convolutional Neural Networks and Generative Adversial Networks

Spatiotemporal 2D Skeleton-based Image for Dynamic Gesture Recognition Using Convolutional Neural Networks

Export Citation Format