MC-LSTM: Real-time 3D Human Action Detection System for Intelligent Healthcare Applications

Author(s):  
Jun Yin ◽  
Jun Han ◽  
Ruiqi Xie ◽  
Chenghao Wang ◽  
Xuyang Duan ◽  
...  
2021 ◽  
Author(s):  
Edwin Kwadwo Tenagyei ◽  
Zongbo Hao ◽  
Kwadwo Kusi ◽  
Kwabena Sarpong

Digital Twin ◽  
2021 ◽  
Vol 1 ◽  
pp. 10
Author(s):  
Qing Hong ◽  
Yifeng Sun ◽  
Tingyu Liu ◽  
Liang Fu ◽  
Yunfeng Xie

Background: Intelligent monitoring of human action in production is an important step to help standardize production processes and construct a digital twin shop-floor rapidly. Human action has a significant impact on the production safety and efficiency of a shop-floor, however, because of the high individual initiative of humans, it is difficult to realize real-time action detection in a digital twin shop-floor. Methods: We proposed a real-time detection approach for shop-floor production action. This approach used the sequence data of continuous human skeleton joints sequences as the input. We then reconstructed the Joint Classification-Regression Recurrent Neural Networks (JCR-RNN) based on Temporal Convolution Network (TCN) and Graph Convolution Network (GCN). We called this approach the Temporal Action Detection Net (TAD-Net), which realized real-time shop-floor production action detection. Results: The results of the verification experiment showed that our approach has achieved a high temporal positioning score, recognition speed, and accuracy when applied to the existing Online Action Detection (OAD) dataset and the Nanjing University of Science and Technology 3 Dimensions (NJUST3D) dataset. TAD-Net can meet the actual needs of the digital twin shop-floor. Conclusions: Our method has higher recognition accuracy, temporal positioning accuracy, and faster running speed than other mainstream network models, it can better meet actual application requirements, and has important research value and practical significance for standardizing shop-floor production processes, reducing production security risks, and contributing to the understanding of real-time production action.


Author(s):  
Mohammadamin Barekatain ◽  
Miquel Marti ◽  
Hsueh-Fu Shih ◽  
Samuel Murray ◽  
Kotaro Nakayama ◽  
...  

Author(s):  
Dianting Liu ◽  
Yilin Yan ◽  
Mei-Ling Shyu ◽  
Guiru Zhao ◽  
Min Chen

Understanding semantic meaning of human actions captured in unconstrained environments has broad applications in fields ranging from patient monitoring, human-computer interaction, to surveillance systems. However, while great progresses have been achieved on automatic human action detection and recognition in videos that are captured in controlled/constrained environments, most existing approaches perform unsatisfactorily on videos with uncontrolled/unconstrained conditions (e.g., significant camera motion, background clutter, scaling, and light conditions). To address this issue, the authors propose a robust human action detection and recognition framework that works effectively on videos taken in controlled or uncontrolled environments. Specifically, the authors integrate the optical flow field and Harris3D corner detector to generate a new spatial-temporal information representation for each video sequence, from which the general Gaussian mixture model (GMM) is learned. All the mean vectors of the Gaussian components in the generated GMM model are concatenated to create the GMM supervector for video action recognition. They build a boosting classifier based on a set of sparse representation classifiers and hamming distance classifiers to improve the accuracy of action recognition. The experimental results on two broadly used public data sets, KTH and UCF YouTube Action, show that the proposed framework outperforms the other state-of-the-art approaches on both action detection and recognition.


Sign in / Sign up

Export Citation Format

Share Document