Hand shape estimation under complex backgrounds for sign language recognition

Author(s):  
Y. Hamada ◽  
N. Shimada ◽  
Y. Shirai
2020 ◽  
Vol 34 (07) ◽  
pp. 13009-13016 ◽  
Author(s):  
Hao Zhou ◽  
Wengang Zhou ◽  
Yun Zhou ◽  
Houqiang Li

Despite the recent success of deep learning in continuous sign language recognition (CSLR), deep models typically focus on the most discriminative features, ignoring other potentially non-trivial and informative contents. Such characteristic heavily constrains their capability to learn implicit visual grammars behind the collaboration of different visual cues (i,e., hand shape, facial expression and body posture). By injecting multi-cue learning into neural network design, we propose a spatial-temporal multi-cue (STMC) network to solve the vision-based sequence learning problem. Our STMC network consists of a spatial multi-cue (SMC) module and a temporal multi-cue (TMC) module. The SMC module is dedicated to spatial representation and explicitly decomposes visual features of different cues with the aid of a self-contained pose estimation branch. The TMC module models temporal correlations along two parallel paths, i.e., intra-cue and inter-cue, which aims to preserve the uniqueness and explore the collaboration of multiple cues. Finally, we design a joint optimization strategy to achieve the end-to-end sequence learning of the STMC network. To validate the effectiveness, we perform experiments on three large-scale CSLR benchmarks: PHOENIX-2014, CSL and PHOENIX-2014-T. Experimental results demonstrate that the proposed method achieves new state-of-the-art performance on all three benchmarks.


Author(s):  
Yong Hu

With a wide variety of big data applications, Sign Language Recognition has become one of the most important research areas in the field of human-computer interaction. Despite recent progresses, the task of classifying finger spelling is still very challenging in Sign Language Recognition. The visually similarity of some signs, the invisibility of the thumb and the large amount of variation by different signers are all make the hand shape recognition very challenging. The work presented in this paper aims to evaluate the performance of some state-of-the-art features for static finger spelling of alphabets in sign language recognition. The comparison experiments were implemented and tested using two popular data sets. Based on the experimental results, analysis and recommendations are given on the efficiency and capabilities of the compared features.


Sign in / Sign up

Export Citation Format

Share Document