Global-Local Enhancement Network for NMF-Aware Sign Language Recognition

Sign language recognition (SLR) is a challenging problem, involving complex manual features (i.e., hand gestures) and fine-grained non-manual features (NMFs) (i.e., facial expression, mouth shapes, etc .). Although manual features are dominant, non-manual features also play an important role in the expression of a sign word. Specifically, many sign words convey different meanings due to non-manual features, even though they share the same hand gestures. This ambiguity introduces great challenges in the recognition of sign words. To tackle the above issue, we propose a simple yet effective architecture called Global-Local Enhancement Network (GLE-Net), including two mutually promoted streams toward different crucial aspects of SLR. Of the two streams, one captures the global contextual relationship, while the other stream captures the discriminative fine-grained cues. Moreover, due to the lack of datasets explicitly focusing on this kind of feature, we introduce the first non-manual-feature-aware isolated Chinese sign language dataset (NMFs-CSL) with a total vocabulary size of 1,067 sign words in daily life. Extensive experiments on NMFs-CSL and SLR500 datasets demonstrate the effectiveness of our method.

Download Full-text

Kurdish Sign Language Recognition System

UKH Journal of Science and Engineering ◽

10.25079/ukhjse.v2n1y2018.pp1-6 ◽

2018 ◽

Vol 2 (1) ◽

pp. 1-6

Author(s):

Abdulla D. Hashim ◽

Fattah Alizadeh

Keyword(s):

Sign Language ◽

The Other ◽

Prototype System ◽

Deaf People ◽

Language Recognition ◽

Sign Language Recognition ◽

Scale Invariant ◽

Hand Gestures ◽

Real Time System ◽

Speeded Up Robust Features

Deaf people all around the world face difficulty to communicate with the others. Hence, they use their own language to communicate with each other. On the other hand, it is difficult for deaf people to get used to technological services such as websites, television, mobile applications, and so on. This project aims to design a prototype system for deaf people to help them to communicate with other people and computers without relying on human interpreters. The proposed system is for letter-based Kurdish Sign Language (KuSL) which has not been introduced before. The system would be a real-time system that takes actions immediately after detecting hand gestures. Three algorithms for detecting KuSL have been implemented and tested, two of them are well-known methods that have been implemented and tested by other researchers, and the third one has been introduced in this paper for the 1st time. The new algorithm is named Gridbased gesture descriptor. It turned out to be the best method for the recognition of Kurdish hand signs. Furthermore, the result of the algorithm was 67% accuracy of detecting hand gestures. Finally, the other well-known algorithms are named scale invariant feature transform and speeded-up robust features, and they responded with 42% of accuracy.

Download Full-text

Indian Sign Language Recognition on PYNQ Board

Recent Advances in Computer Science and Communications ◽

10.2174/2666255813999200909110140 ◽

2020 ◽

Vol 13 ◽

Author(s):

Sukhendra Singh ◽

G. N. Rathna ◽

Vivek Singhal

Keyword(s):

Sign Language ◽

Hand Gesture ◽

Language Recognition ◽

Sign Language Recognition ◽

Hand Gestures ◽

Depth Images ◽

Kinect Camera ◽

Impaired People ◽

Web Camera ◽

Indian Sign Language

Introduction: Sign language is the only way to communicate for speech-impaired people. But this sign language is not known to normal people so this is the cause of barrier in communicating. This is the problem faced by speech impaired people. In this paper, we have presented our solution which captured hand gestures with Kinect camera and classified the hand gesture into its correct symbol. Method: We used Kinect camera not the ordinary web camera because the ordinary camera does not capture its 3d orientation or depth of an image from camera however Kinect camera can capture 3d image and this will make classification more accurate. Result: Kinect camera will produce a different image for hand gestures for ‘2’ and ‘V’ and similarly for ‘1’ and ‘I’ however, normal web camera will not be able to distinguish between these two. We used hand gesture for Indian sign language and our dataset had 46339, RGB images and 46339 depth images. 80% of the total images were used for training and the remaining 20% for testing. In total 36 hand gestures were considered to capture alphabets and alphabets from A-Z and 10 for numeric, 26 for digits from 0-9 were considered to capture alphabets and Keywords. Conclusion: Along with real-time implementation, we have also shown the comparison of the performance of the various machine learning models in which we have found out the accuracy of CNN on depth- images has given the most accurate performance than other models. All these resulted were obtained on PYNQ Z2 board.

Download Full-text

Wi-SL: Contactless Fine-Grained Gesture Recognition Uses Channel State Information

Sensors ◽

10.3390/s20144025 ◽

2020 ◽

Vol 20 (14) ◽

pp. 4025

Author(s):

Zhanjun Hao ◽

Yu Duan ◽

Xiaochao Dang ◽

Yang Liu ◽

Daiyang Zhang

Keyword(s):

Sign Language ◽

Gesture Recognition ◽

Channel State Information ◽

Computational Cost ◽

Support Vector ◽

Language Recognition ◽

Sign Language Recognition ◽

Channel State ◽

State Information ◽

Fine Grained

In recent years, with the development of wireless sensing technology and the widespread popularity of WiFi devices, human perception based on WiFi has become possible, and gesture recognition has become an active topic in the field of human-computer interaction. As a kind of gesture, sign language is widely used in life. The establishment of an effective sign language recognition system can help people with aphasia and hearing impairment to better interact with the computer and facilitate their daily life. For this reason, this paper proposes a contactless fine-grained gesture recognition method using Channel State Information (CSI), namely Wi-SL. This method uses a commercial WiFi device to establish the correlation mapping between the amplitude and phase difference information of the subcarrier level in the wireless signal and the sign language action, without requiring the user to wear any device. We combine an efficient denoising method to filter environmental interference with an effective selection of optimal subcarriers to reduce the computational cost of the system. We also use K-means combined with a Bagging algorithm to optimize the Support Vector Machine (SVM) classification (KSB) model to enhance the classification of sign language action data. We implemented the algorithms and evaluated them for three different scenarios. The experimental results show that the average accuracy of Wi-SL gesture recognition can reach 95.8%, which realizes device-free, non-invasive, high-precision sign language gesture recognition.

Download Full-text

Template Matching Based Sign Language Recognition System for Android Devices

FUOYE Journal of Engineering and Technology ◽

10.46792/fuoyejet.v5i1.465 ◽

2020 ◽

Vol 5 (1) ◽

Author(s):

Kudirat O Jimoh ◽

Anuoluwapo O Ajayi ◽

Ibrahim K Ogundoyin

Keyword(s):

Feature Extraction ◽

Sign Language ◽

Template Matching ◽

Principal Component ◽

Recognition System ◽

Generative Adversarial Networks ◽

Language Recognition ◽

Sign Language Recognition ◽

Hand Gestures ◽

Classification Rate

An android based sign language recognition system for selected English vocabularies was developed with the explicit objective to examine the specific characteristics that are responsible for gestures recognition. Also, a recognition model for the process was designed, implemented, and evaluated on 230 samples of hand gestures. The collected samples were pre-processed and rescaled from 3024 ×4032 pixels to 245 ×350 pixels. The samples were examined for the specific characteristics using Oriented FAST and Rotated BRIEF, and the Principal Component Analysis used for feature extraction. The model was implemented in Android Studio using the template matching algorithm as its classifier. The performance of the system was evaluated using precision, recall, and accuracy as metrics. It was observed that the system obtained an average classification rate of 87%, an average precision value of 88% and 91% for the average recall rate on the test data of hand gestures. The study, therefore, has successfully classified hand gestures for selected English vocabularies. The developed system will enhance the communication skills between hearing and hearing-impaired people, and also aid their teaching and learning processes. Future work include exploring state-of-the-art machining learning techniques such Generative Adversarial Networks (GANs) for large dataset to improve the accuracy of results. Keywords— Feature extraction; Gestures Recognition; Sign Language; Vocabulary, Android device.

Download Full-text

A Novel Communication System For Deaf And Dumb People using gesture

ITM Web of Conferences ◽

10.1051/itmconf/20203202003 ◽

2020 ◽

Vol 32 ◽

pp. 02003

Author(s):

Pritesh Ambavane ◽

Rahul Karjavkar ◽

Hemant Pathare ◽

Shubham Relekar ◽

Bhavana Alte ◽

...

Keyword(s):

Sign Language ◽

The Other ◽

Hand Gesture ◽

Language Recognition ◽

Text To Speech ◽

Sign Language Recognition ◽

Digital Conversion ◽

Analog To Digital ◽

Analog To Digital Conversion ◽

The World

Human Beings know each other and contact with themselves through thoughts and ideas.The best way to present our idea is through speech. Some people don’t have the power of speech; the only way they communicate with others is through sign language. Now a days technology has reduced the gap through systems which can be used to change the sign language used by these people to speech. Sign language recognition (SLR) and gesture-based control are two major applications used for hand gesture recognition technologies. On the other side the controller converts the sign language in to the text and speech which gets converted with the help of text to speech conversion and analog to digital conversion. A Dumb person throughout the world uses sign language for the communication.The best way to present our idea is through speech. Some people don’t have the power of speech; the only way they communicate with others is through sign language. Now a days technology has reduced the gap through systems which can be used to change the sign language used by these people to speech. Sign language recognition (SLR) and gesture-based control are two major applications used for hand gesture recognition technologies. On the other side the controller converts the sign language in to the text and speech which gets converted with the help of text to speech conversion and analog to digital conversion. A Dumb person throughout the world uses sign language for the communication.

Download Full-text

AUTOMATIC LIP-READING OF HEARING IMPAIRED PEOPLE

ISPRS - International Archives of the Photogrammetry, Remote Sensing and Spatial Information Sciences ◽

10.5194/isprs-archives-xlii-2-w12-97-2019 ◽

2019 ◽

Vol XLII-2/W12 ◽

pp. 97-101

Author(s):

D. Ivanko ◽

D. Ryumin ◽

A. Karpov

Keyword(s):

Sign Language ◽

Assistive Technologies ◽

Hearing Impaired ◽

Visual Speech ◽

Language Recognition ◽

Sign Language Recognition ◽

Hand Gestures ◽

Human Machine Interaction ◽

Lip Reading ◽

Impaired People

<p><strong>Abstract.</strong> Inability to use speech interfaces greatly limits the deaf and hearing impaired people in the possibility of human-machine interaction. To solve this problem and to increase the accuracy and reliability of the automatic Russian sign language recognition system it is proposed to use lip-reading in addition to hand gestures recognition. Deaf and hearing impaired people use sign language as the main way of communication in everyday life. Sign language is a structured form of hand gestures and lips movements involving visual motions and signs, which is used as a communication system. Since sign language includes not only hand gestures, but also lip movements that mimic vocalized pronunciation, it is of interest to investigate how accurately such a visual speech can be recognized by a lip-reading system, especially considering the fact that the visual speech of hearing impaired people is often characterized with hyper-articulation, which should potentially facilitate its recognition. For this purpose, thesaurus of Russian sign language (TheRusLan) collected in SPIIRAS in 2018–19 was used. The database consists of color optical FullHD video recordings of 13 native Russian sign language signers (11 females and 2 males) from “Pavlovsk boarding school for the hearing impaired”. Each of the signers demonstrated 164 phrases for 5 times. This work covers the initial stages of this research, including data collection, data labeling, region-of-interest detection and methods for informative features extraction. The results of this study can later be used to create assistive technologies for deaf or hearing impaired people.</p>

Download Full-text

Self-organized Evaluation of Dynamic Hand Gestures for Sign Language Recognition

Organic Computing - Understanding Complex Systems ◽

10.1007/978-3-540-77657-4_15 ◽

2009 ◽

pp. 321-342 ◽

Cited By ~ 2

Author(s):

Maximilian Krüger ◽

Christoph von der Malsburg ◽

Rolf P. Würtz

Keyword(s):

Sign Language ◽

Language Recognition ◽

Sign Language Recognition ◽

Hand Gestures ◽

Self Organized

Download Full-text

Effect of Sign-recognition Performance on the Usability of Sign-language Dictionary Search

ACM Transactions on Accessible Computing ◽

10.1145/3470650 ◽

2021 ◽

Vol 14 (4) ◽

pp. 1-33

Author(s):

Saad Hassan ◽

Oliver Alonzo ◽

Abraham Glasser ◽

Matt Huenerfauth

Keyword(s):

Sign Language ◽

Recognition Performance ◽

The Other ◽

Language Recognition ◽

Sign Language Recognition ◽

Search System ◽

Wizard Of Oz ◽

Sign Recognition ◽

Technology Users ◽

The Relationship

Advances in sign-language recognition technology have enabled researchers to investigate various methods that can assist users in searching for an unfamiliar sign in ASL using sign-recognition technology. Users can generate a query by submitting a video of themselves performing the sign they believe they encountered somewhere and obtain a list of possible matches. However, there is disagreement among developers of such technology on how to report the performance of their systems, and prior research has not examined the relationship between the performance of search technology and users’ subjective judgements for this task. We conducted three studies using a Wizard-of-Oz prototype of a webcam-based ASL dictionary search system to investigate the relationship between the performance of such a system and user judgements. We found that, in addition to the position of the desired word in a list of results, the placement of the desired word above or below the fold and the similarity of the other words in the results list affected users’ judgements of the system. We also found that metrics that incorporate the precision of the overall list correlated better with users’ judgements than did metrics currently reported in prior ASL dictionary research.

Download Full-text

Stochastic Fine-Grained Labeling of Multi-state Sign Glosses for Continuous Sign Language Recognition

Computer Vision – ECCV 2020 - Lecture Notes in Computer Science ◽

10.1007/978-3-030-58517-4_11 ◽

2020 ◽

pp. 172-186

Author(s):

Zhe Niu ◽

Brian Mak

Keyword(s):

Sign Language ◽

Language Recognition ◽

Sign Language Recognition ◽

Fine Grained

Download Full-text

STUDY OF HAND GESTURE RECOGNITION AND CLASSIFICATION

Asian Journal of Pharmaceutical and Clinical Research ◽

10.22159/ajpcr.2017.v10s1.19540 ◽

2017 ◽

Vol 10 (13) ◽

pp. 25

Author(s):

Srinivas K ◽

Manoj Kumar Rajagopal

Keyword(s):

Sign Language ◽

Gesture Recognition ◽

Hand Movement ◽

Hand Gesture Recognition ◽

Hand Movements ◽

Support Vector ◽

Hand Gesture ◽

Language Recognition ◽

Sign Language Recognition ◽

Hand Gestures

To recognize different hand gestures and achieve efficient classification to understand static and dynamic hand movements used for communications.Static and dynamic hand movements are first captured using gesture recognition devices including Kinect device, hand movement sensors, connecting electrodes, and accelerometers. These gestures are processed using hand gesture recognition algorithms such as multivariate fuzzy decision tree, hidden Markov models (HMM), dynamic time warping framework, latent regression forest, support vector machine, and surface electromyogram. Hand movements made by both single and double hands are captured by gesture capture devices with proper illumination conditions. These captured gestures are processed for occlusions and fingers close interactions for identification of right gesture and to classify the gesture and ignore the intermittent gestures. Real-time hand gestures recognition needs robust algorithms like HMM to detect only the intended gesture. Classified gestures are then compared for the effectiveness with training and tested standard datasets like sign language alphabets and KTH datasets. Hand gesture recognition plays a very important role in some of the applications such as sign language recognition, robotics, television control, rehabilitation, and music orchestration.

Download Full-text