Real-Time Human Action Recognition System Using Depth Map Sequences

This work presents a real-time human action recognition system that uses depth map sequence as input. The system contains the segmentation of human, the action modeling based on 3D shape context and the action graph algorithm. We effectively solve the problem of segmenting human from complex and cluttered scenes by combing a novel quadtree split-and-merge method and the codebook background modeling algorithm. We aims at recognizing actions that are used in games and interactions, especially complex actions that contain foot motion and body heave. By expanding the shape context descriptor into 3D space, we obtain translation and scale invariant features and get rid of normalization error, which is a common problem of real-life applications. Experiments in various scenarios demonstrate the high speed and excellent performance of our procedure.

Download Full-text

A Real-Time Human Action Recognition System Using Depth and Inertial Sensor Fusion

IEEE Sensors Journal ◽

10.1109/jsen.2015.2487358 ◽

2016 ◽

Vol 16 (3) ◽

pp. 773-781 ◽

Cited By ~ 73

Author(s):

Chen Chen ◽

Roozbeh Jafari ◽

Nasser Kehtarnavaz

Keyword(s):

Real Time ◽

Sensor Fusion ◽

Action Recognition ◽

Inertial Sensor ◽

Human Action Recognition ◽

Recognition System ◽

Human Action

Download Full-text

Real-Time Implementation of Human Action Recognition System Based on Motion Analysis

Artificial Intelligence and Computer Vision - Studies in Computational Intelligence ◽

10.1007/978-3-319-46245-5_9 ◽

2016 ◽

pp. 143-164 ◽

Cited By ~ 4

Author(s):

Kamal Sehairi ◽

Cherrad Benbouchama ◽

El Houari Kobzili ◽

Fatima Chouireb

Keyword(s):

Real Time ◽

Motion Analysis ◽

Action Recognition ◽

Human Action Recognition ◽

Recognition System ◽

Human Action

Download Full-text

A View-Based Real-Time Human Action Recognition System as an Interface for Human Computer Interaction

Virtual Systems and Multimedia - Lecture Notes in Computer Science ◽

10.1007/978-3-540-78566-8_10 ◽

2008 ◽

pp. 112-120 ◽

Cited By ~ 7

Author(s):

Jin Choi ◽

Yong-il Cho ◽

Taewoo Han ◽

Hyun S. Yang

Keyword(s):

Human Computer Interaction ◽

Real Time ◽

Action Recognition ◽

Human Action Recognition ◽

Recognition System ◽

Human Action ◽

Computer Interaction

Download Full-text

Low-Cost Embedded System Using Convolutional Neural Networks-Based Spatiotemporal Feature Map for Real-Time Human Action Recognition

Applied Sciences ◽

10.3390/app11114940 ◽

2021 ◽

Vol 11 (11) ◽

pp. 4940

Author(s):

Jinsoo Kim ◽

Jeongho Cho

Keyword(s):

Embedded System ◽

Real Time ◽

Action Recognition ◽

Processing Speed ◽

Recognition Accuracy ◽

Low Cost ◽

Human Action Recognition ◽

Human Action ◽

Video Data ◽

Feature Maps

The field of research related to video data has difficulty in extracting not only spatial but also temporal features and human action recognition (HAR) is a representative field of research that applies convolutional neural network (CNN) to video data. The performance for action recognition has improved, but owing to the complexity of the model, some still limitations to operation in real-time persist. Therefore, a lightweight CNN-based single-stream HAR model that can operate in real-time is proposed. The proposed model extracts spatial feature maps by applying CNN to the images that develop the video and uses the frame change rate of sequential images as time information. Spatial feature maps are weighted-averaged by frame change, transformed into spatiotemporal features, and input into multilayer perceptrons, which have a relatively lower complexity than other HAR models; thus, our method has high utility in a single embedded system connected to CCTV. The results of evaluating action recognition accuracy and data processing speed through challenging action recognition benchmark UCF-101 showed higher action recognition accuracy than the HAR model using long short-term memory with a small amount of video frames and confirmed the real-time operational possibility through fast data processing speed. In addition, the performance of the proposed weighted mean-based HAR model was verified by testing it in Jetson NANO to confirm the possibility of using it in low-cost GPU-based embedded systems.

Download Full-text

Exploring 3D Human Action Recognition Using STACOG on Multi-View Depth Motion Maps Sequences

Sensors ◽

10.3390/s21113642 ◽

2021 ◽

Vol 21 (11) ◽

pp. 3642

Author(s):

Mohammad Farhad Bulbul ◽

Sadiya Tabussum ◽

Hazrat Ali ◽

Wenli Zheng ◽

Mi Young Lee ◽

...

Keyword(s):

Action Recognition ◽

Depth Map ◽

Human Action Recognition ◽

Human Action ◽

Collaborative Representation ◽

Auto Correlation ◽

Time Operation ◽

Real Time Operation ◽

Benchmark Datasets ◽

Depth Motion Maps

This paper proposes an action recognition framework for depth map sequences using the 3D Space-Time Auto-Correlation of Gradients (STACOG) algorithm. First, each depth map sequence is split into two sets of sub-sequences of two different frame lengths individually. Second, a number of Depth Motion Maps (DMMs) sequences from every set are generated and are fed into STACOG to find an auto-correlation feature vector. For two distinct sets of sub-sequences, two auto-correlation feature vectors are obtained and applied gradually to L2-regularized Collaborative Representation Classifier (L2-CRC) for computing a pair of sets of residual values. Next, the Logarithmic Opinion Pool (LOGP) rule is used to combine the two different outcomes of L2-CRC and to allocate an action label of the depth map sequence. Finally, our proposed framework is evaluated on three benchmark datasets named MSR-action 3D dataset, DHA dataset, and UTD-MHAD dataset. We compare the experimental results of our proposed framework with state-of-the-art approaches to prove the effectiveness of the proposed framework. The computational efficiency of the framework is also analyzed for all the datasets to check whether it is suitable for real-time operation or not.

Download Full-text

Real-time human action recognition based on motion shapes

10.47749/t/unicamp.2014.932501 ◽

2014 ◽

Author(s):

Thierry Pinheiro Moreira

Keyword(s):

Real Time ◽

Action Recognition ◽

Human Action Recognition ◽

Human Action

Download Full-text

A review of real-time human action recognition involving vision sensing

Real-Time Image Processing and Deep Learning 2021 ◽

10.1117/12.2585680 ◽

2021 ◽

Author(s):

Sharmin Majumder ◽

Nasser Kehtarnavaz

Keyword(s):

Real Time ◽

Action Recognition ◽

Human Action Recognition ◽

Human Action ◽

Vision Sensing

Download Full-text

Real Time Human Action Recognition Using Full and Ultra High Definition Video

2015 International Conference on Computational Science and Computational Intelligence (CSCI) ◽

10.1109/csci.2015.12 ◽

2015 ◽

Cited By ~ 3

Author(s):

Gloria Castro-Munoz ◽

Jorge Martinez-Carballido

Keyword(s):

Real Time ◽

Action Recognition ◽

Human Action Recognition ◽

Human Action ◽

High Definition ◽

High Definition Video

Download Full-text

A Low-Dimensional Radial Silhouette-Based Feature for Fast Human Action Recognition Fusing Multiple Views

International Scholarly Research Notices ◽

10.1155/2014/547069 ◽

2014 ◽

Vol 2014 ◽

pp. 1-11 ◽

Cited By ~ 8

Author(s):

Alexandros Andre Chaaraoui ◽

Francisco Flórez-Revuelta

Keyword(s):

Real Time ◽

Action Recognition ◽

Assisted Living ◽

Learning Algorithm ◽

Ambient Assisted Living ◽

Human Action Recognition ◽

Human Action ◽

Sequence Matching ◽

Low Dimensional ◽

Video Frequency

This paper presents a novel silhouette-based feature for vision-based human action recognition, which relies on the contour of the silhouette and a radial scheme. Its low-dimensionality and ease of extraction result in an outstanding proficiency for real-time scenarios. This feature is used in a learning algorithm that by means of model fusion of multiple camera streams builds a bag of key poses, which serves as a dictionary of known poses and allows converting the training sequences into sequences of key poses. These are used in order to perform action recognition by means of a sequence matching algorithm. Experimentation on three different datasets returns high and stable recognition rates. To the best of our knowledge, this paper presents the highest results so far on the MuHAVi-MAS dataset. Real-time suitability is given, since the method easily performs above video frequency. Therefore, the related requirements that applications as ambient-assisted living services impose are successfully fulfilled.

Download Full-text