STOP: Space-Time Occupancy Patterns for 3D Action Recognition from Depth Map Sequences

Background & Objective: Identification of human actions from video has gathered much attention in past few years. Most of the computer vision tasks such as Health Care Activity Detection, Suspicious Activity detection, Human Computer Interactions etc. are based on the principle of activity detection. Automatic labelling of activity from videos frames is known as activity detection. Motivation of this work is to use most out of the data generated from sensors and use them for recognition of classes. Recognition of actions from videos sequences is a growing field with the upcoming trends of deep neural networks. Automatic learning capability of Convolutional Neural Network (CNN) make them good choice as compared to traditional handcrafted based approaches. With the increasing demand of RGB-D sensors combination of RGB and depth data is in great demand. This work comprises of the use of dynamic images generated from RGB combined with depth map for action recognition purpose. We have experimented our approach on pre trained VGG-F model using MSR Daily activity dataset and UTD MHAD Dataset. We achieve state of the art results. To support our research, we have calculated different parameters apart from accuracy such as precision, F score, recall. Conclusion: Accordingly, the investigation confirms improvement in term of accuracy, precision, F-Score and Recall. The proposed model is 4 Stream model is prone to occlusion, used in real time and also the data from the RGB-D sensor is fully utilized.

Download Full-text

Exploring 3D Human Action Recognition Using STACOG on Multi-View Depth Motion Maps Sequences

Sensors ◽

10.3390/s21113642 ◽

2021 ◽

Vol 21 (11) ◽

pp. 3642

Author(s):

Mohammad Farhad Bulbul ◽

Sadiya Tabussum ◽

Hazrat Ali ◽

Wenli Zheng ◽

Mi Young Lee ◽

...

Keyword(s):

Action Recognition ◽

Depth Map ◽

Human Action Recognition ◽

Human Action ◽

Collaborative Representation ◽

Auto Correlation ◽

Time Operation ◽

Real Time Operation ◽

Benchmark Datasets ◽

Depth Motion Maps

This paper proposes an action recognition framework for depth map sequences using the 3D Space-Time Auto-Correlation of Gradients (STACOG) algorithm. First, each depth map sequence is split into two sets of sub-sequences of two different frame lengths individually. Second, a number of Depth Motion Maps (DMMs) sequences from every set are generated and are fed into STACOG to find an auto-correlation feature vector. For two distinct sets of sub-sequences, two auto-correlation feature vectors are obtained and applied gradually to L2-regularized Collaborative Representation Classifier (L2-CRC) for computing a pair of sets of residual values. Next, the Logarithmic Opinion Pool (LOGP) rule is used to combine the two different outcomes of L2-CRC and to allocate an action label of the depth map sequence. Finally, our proposed framework is evaluated on three benchmark datasets named MSR-action 3D dataset, DHA dataset, and UTD-MHAD dataset. We compare the experimental results of our proposed framework with state-of-the-art approaches to prove the effectiveness of the proposed framework. The computational efficiency of the framework is also analyzed for all the datasets to check whether it is suitable for real-time operation or not.

Download Full-text

Space-Time Robust Representation for Action Recognition

2013 IEEE International Conference on Computer Vision ◽

10.1109/iccv.2013.336 ◽

2013 ◽

Cited By ~ 22

Author(s):

Nicolas Ballas ◽

Yi Yang ◽

Zhen-Zhong Lan ◽

Bertrand Delezoide ◽

Francoise Preteux ◽

...

Keyword(s):

Action Recognition ◽

Space Time ◽

Robust Representation

Download Full-text

Space-time tree ensemble for action recognition

2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) ◽

10.1109/cvpr.2015.7299137 ◽

2015 ◽

Cited By ~ 24

Author(s):

Shugao Ma ◽

Leonid Sigal ◽

Stan Sclaroff

Keyword(s):

Action Recognition ◽

Space Time

Download Full-text

A Hierarchical Bag-of-Words Model Based on Local Space-Time Features for Human Action Recognition

2013 International Conference on IT Convergence and Security (ICITCS) ◽

10.1109/icitcs.2013.6717776 ◽

2013 ◽

Cited By ~ 1

Author(s):

Jiangwei Wu ◽

Daobing Zhou ◽

Guoqiang Xiao

Keyword(s):

Action Recognition ◽

Human Action Recognition ◽

Human Action ◽

Space Time ◽

Bag Of Words ◽

Model Based ◽

Local Space

Download Full-text

Automatic Badminton Action Recognition Using RGB-D Sensor

Advanced Materials Research ◽

10.4028/www.scientific.net/amr.1042.89 ◽

2014 ◽

Vol 1042 ◽

pp. 89-93 ◽

Cited By ~ 4

Author(s):

H.Y. Ting ◽

K.S. Sim ◽

F.S. Abas

Keyword(s):

Action Recognition ◽

Support Vector Machine Classifier ◽

Recognition Accuracy ◽

Depth Map ◽

Microsoft Kinect ◽

Experimental Result ◽

Support Vector ◽

Validation Test ◽

And Gender ◽

Microsoft Kinect Sensor

This paper presents a method to recognize badminton action from depth map sequences acquired by Microsoft Kinect sensor. Badminton is one of Malaysia’s most popular, but there is still lack of research on action recognition focusing on this sport. In this research, bone orientation details of badminton players are computed and extracted in order to form a bag of quaternions feature vectors. After conversion to log-covariance matrix, the system is trained and the badminton actions are classified by a support vector machine classifier. Our experimental dataset of depth map sequences composed of 300 badminton action samples of 10 badminton actions performed by six badminton players. The dataset varies in terms of human body size, clothes, speed, and gender. Experimental result has shown that nearly 92% of average recognition accuracy (ARA) was achieved in inter-class leave one sample out cross validation test. At the same time, 86% of ARA was achieved in inter-class cross subject validation test.

Download Full-text