Sinhala Sign Language Interpreter Optimized for Real – Time Implementation on a Mobile Device

We present a high-quality sky segmentation model for depth refinement and investigate residual architecture performance to inform optimally shrinking the network. We describe a model that runs in near real-time on mobile device, present a new, highquality dataset, and detail a unique weighing to trade off false positives and false negatives in binary classifiers. We show how the optimizations improve bokeh rendering by correcting stereo depth misprediction in sky regions. We detail techniques used to preserve edges, reject false positives, and ensure generalization to the diversity of sky scenes. Finally, we present a compact model and compare performance of four popular residual architectures (ShuffleNet, MobileNetV2, Resnet-101, and Resnet-34-like) at constant computational cost.

Download Full-text

Real-Time Bangla Sign Language Detection with Sentence and Speech Generation

2020 23rd International Conference on Computer and Information Technology (ICCIT) ◽

10.1109/iccit51783.2020.9392693 ◽

2020 ◽

Author(s):

Dipon Talukder ◽

Fatima Jahara

Keyword(s):

Real Time ◽

Sign Language ◽

Speech Generation ◽

Language Detection

Download Full-text

Real-time Sign Language Recognition using Computer Vision

2021 3rd International Conference on Signal Processing and Communication (ICPSC) ◽

10.1109/icspc51351.2021.9451709 ◽

2021 ◽

Author(s):

Jinalee Jayeshkumar Raval ◽

Ruchi Gajjar

Keyword(s):

Computer Vision ◽

Real Time ◽

Sign Language ◽

Language Recognition ◽

Sign Language Recognition

Download Full-text

Two Dimensional Convolutional Neural Network Approach for Real-Time Bangla Sign Language Characters Recognition and Translation

SN Computer Science ◽

10.1007/s42979-021-00783-6 ◽

2021 ◽

Vol 2 (5) ◽

Author(s):

Md. Shahinur Alam ◽

Mahib Tanvir ◽

Dip Kumar Saha ◽

Sajal K. Das

Keyword(s):

Neural Network ◽

Convolutional Neural Network ◽

Real Time ◽

Sign Language ◽

Two Dimensional ◽

Network Approach ◽

Neural Network Approach

Download Full-text

Enabling Real-time Sign Language Translation on Mobile Platforms with On-board Depth Cameras

Proceedings of the ACM on Interactive Mobile Wearable and Ubiquitous Technologies ◽

10.1145/3463498 ◽

2021 ◽

Vol 5 (2) ◽

pp. 1-30

Author(s):

HyeonJung Park ◽

Youngki Lee ◽

JeongGil Ko

Keyword(s):

Real Time ◽

Sign Language ◽

Data Augmentation ◽

Language Translation ◽

Mobile Platforms ◽

Depth Cameras ◽

Language Data ◽

In The Wild ◽

Environmental Robustness ◽

Cloud Servers

In this work we present SUGO, a depth video-based system for translating sign language to text using a smartphone's front camera. While exploiting depth-only videos offer benefits such as being less privacy-invasive compared to using RGB videos, it introduces new challenges which include dealing with low video resolutions and the sensors' sensitiveness towards user motion. We overcome these challenges by diversifying our sign language video dataset to be robust to various usage scenarios via data augmentation and design a set of schemes to emphasize human gestures from the input images for effective sign detection. The inference engine of SUGO is based on a 3-dimensional convolutional neural network (3DCNN) to classify a sequence of video frames as a pre-trained word. Furthermore, the overall operations are designed to be light-weight so that sign language translation takes place in real-time using only the resources available on a smartphone, with no help from cloud servers nor external sensing components. Specifically, to train and test SUGO, we collect sign language data from 20 individuals for 50 Korean Sign Language words, summing up to a dataset of ~5,000 sign gestures and collect additional in-the-wild data to evaluate the performance of SUGO in real-world usage scenarios with different lighting conditions and daily activities. Comprehensively, our extensive evaluations show that SUGO can properly classify sign words with an accuracy of up to 91% and also suggest that the system is suitable (in terms of resource usage, latency, and environmental robustness) to enable a fully mobile solution for sign language translation.

Download Full-text