MC-LSTM: Real-time 3D Human Action Detection System for Intelligent Healthcare Applications

Background: Intelligent monitoring of human action in production is an important step to help standardize production processes and construct a digital twin shop-floor rapidly. Human action has a significant impact on the production safety and efficiency of a shop-floor, however, because of the high individual initiative of humans, it is difficult to realize real-time action detection in a digital twin shop-floor. Methods: We proposed a real-time detection approach for shop-floor production action. This approach used the sequence data of continuous human skeleton joints sequences as the input. We then reconstructed the Joint Classification-Regression Recurrent Neural Networks (JCR-RNN) based on Temporal Convolution Network (TCN) and Graph Convolution Network (GCN). We called this approach the Temporal Action Detection Net (TAD-Net), which realized real-time shop-floor production action detection. Results: The results of the verification experiment showed that our approach has achieved a high temporal positioning score, recognition speed, and accuracy when applied to the existing Online Action Detection (OAD) dataset and the Nanjing University of Science and Technology 3 Dimensions (NJUST3D) dataset. TAD-Net can meet the actual needs of the digital twin shop-floor. Conclusions: Our method has higher recognition accuracy, temporal positioning accuracy, and faster running speed than other mainstream network models, it can better meet actual application requirements, and has important research value and practical significance for standardizing shop-floor production processes, reducing production security risks, and contributing to the understanding of real-time production action.

Download Full-text

Fast action proposals for human action detection and search

2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) ◽

10.1109/cvpr.2015.7298735 ◽

2015 ◽

Cited By ~ 88

Author(s):

Gang Yu ◽

Junsong Yuan

Keyword(s):

Human Action ◽

Action Detection ◽

Human Action Detection

Download Full-text

Human Action Detection and Recognition Using SIFT and SVM

Communications in Computer and Information Science - Cognitive Computing and Information Processing ◽

10.1007/978-981-10-9059-2_42 ◽

2018 ◽

pp. 475-491 ◽

Cited By ~ 2

Author(s):

Praveen M. Dhulavvagol ◽

Niranjan C. Kundur

Keyword(s):

Human Action ◽

Action Detection ◽

Human Action Detection ◽

Detection And Recognition

Download Full-text

Okutama-Action: An Aerial View Video Dataset for Concurrent Human Action Detection

2017 IEEE Conference on Computer Vision and Pattern Recognition Workshops (CVPRW) ◽

10.1109/cvprw.2017.267 ◽

2017 ◽

Cited By ~ 35

Author(s):

Mohammadamin Barekatain ◽

Miquel Marti ◽

Hsueh-Fu Shih ◽

Samuel Murray ◽

Kotaro Nakayama ◽

...

Keyword(s):

Human Action ◽

Action Detection ◽

Human Action Detection ◽

Aerial View

Download Full-text

Human action detection using PNF propagation of temporal constraints

Proceedings. 1998 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (Cat. No.98CB36231) ◽

10.1109/cvpr.1998.698711 ◽

2002 ◽

Cited By ~ 36

Author(s):

C.S. Pinhanez ◽

A.F. Bobick

Keyword(s):

Human Action ◽

Temporal Constraints ◽

Action Detection ◽

Human Action Detection

Download Full-text

Efficient Human Action Detection Using a Transferable Distance Function

Computer Vision – ACCV 2009 - Lecture Notes in Computer Science ◽

10.1007/978-3-642-12304-7_39 ◽

2010 ◽

pp. 417-426 ◽

Cited By ~ 3

Author(s):

Weilong Yang ◽

Yang Wang ◽

Greg Mori

Keyword(s):

Distance Function ◽

Human Action ◽

Action Detection ◽

Human Action Detection

Download Full-text

Spatio-Temporal Analysis for Human Action Detection and Recognition in Uncontrolled Environments

International Journal of Multimedia Data Engineering and Management ◽

10.4018/ijmdem.2015010101 ◽

2015 ◽

Vol 6 (1) ◽

pp. 1-18 ◽

Cited By ~ 28

Author(s):

Dianting Liu ◽

Yilin Yan ◽

Mei-Ling Shyu ◽

Guiru Zhao ◽

Min Chen

Keyword(s):

Action Recognition ◽

Hamming Distance ◽

Gaussian Mixture ◽

Human Action ◽

Surveillance Systems ◽

Camera Motion ◽

Action Detection ◽

Public Data ◽

Human Action Detection ◽

Detection And Recognition

Understanding semantic meaning of human actions captured in unconstrained environments has broad applications in fields ranging from patient monitoring, human-computer interaction, to surveillance systems. However, while great progresses have been achieved on automatic human action detection and recognition in videos that are captured in controlled/constrained environments, most existing approaches perform unsatisfactorily on videos with uncontrolled/unconstrained conditions (e.g., significant camera motion, background clutter, scaling, and light conditions). To address this issue, the authors propose a robust human action detection and recognition framework that works effectively on videos taken in controlled or uncontrolled environments. Specifically, the authors integrate the optical flow field and Harris3D corner detector to generate a new spatial-temporal information representation for each video sequence, from which the general Gaussian mixture model (GMM) is learned. All the mean vectors of the Gaussian components in the generated GMM model are concatenated to create the GMM supervector for video action recognition. They build a boosting classifier based on a set of sparse representation classifiers and hamming distance classifiers to improve the accuracy of action recognition. The experimental results on two broadly used public data sets, KTH and UCF YouTube Action, show that the proposed framework outperforms the other state-of-the-art approaches on both action detection and recognition.

Download Full-text