Spatio-Temporal Image Representation of 3D Skeletal Movements for View-Invariant Action Recognition with Deep Convolutional Neural Networks

Designing motion representations for the problem of 3D human action recognition from skeleton sequences is an important yet challenging task. An effective representation should be robust to noise, invariant to viewpoint changes and result in a good performance with low-computational demand. Two main challenges in this task include how to efficiently represent spatio-temporal patterns of skeletal movements and how to learn their discriminative features for classification task. This paper presents a novel skeleton-based representation and a deep learning framework for 3D action recognition using RGB-D sensors. We propose to build an action map called SPMF (Skeleton Posture-Motion Feature), which is a compact image representation built from skeleton poses and their motions. An Adaptive Histogram Equalization (AHE) algorithm is then applied on the SPMF to enhance their local patterns and form an enhanced action map, namely Enhanced-SPMF. For learning and classification tasks, we exploit Deep Convolutional Neural Networks based on the DenseNet architecture to learn directly an end-to-end mapping between input skeleton sequences and their action labels via the Enhanced-SPMFs. The proposed method is evaluated on four challenging benchmark datasets, including both individual actions, interactions, multiview and large-scale datasets. The experimental results demonstrate that the proposed method outperforms previous state-of-the-art approaches on all benchmark tasks, whilst requiring low computational time for training and inference.

Download Full-text

Spatio–Temporal Image Representation of 3D Skeletal Movements for View-Invariant Action Recognition with Deep Convolutional Neural Networks

Sensors ◽

10.3390/s19081932 ◽

2019 ◽

Vol 19 (8) ◽

pp. 1932 ◽

Cited By ~ 4

Author(s):

Huy Hieu Pham ◽

Houssam Salmane ◽

Louahdi Khoudour ◽

Alain Crouzil ◽

Pablo Zegers ◽

...

Keyword(s):

Neural Networks ◽

Convolutional Neural Networks ◽

Action Recognition ◽

Large Scale ◽

Image Representation ◽

Human Action ◽

Computational Time ◽

Deep Convolutional Neural Networks ◽

Classification Tasks ◽

Spatio Temporal

Designing motion representations for 3D human action recognition from skeleton sequences is an important yet challenging task. An effective representation should be robust to noise, invariant to viewpoint changes and result in a good performance with low-computational demand. Two main challenges in this task include how to efficiently represent spatio–temporal patterns of skeletal movements and how to learn their discriminative features for classification tasks. This paper presents a novel skeleton-based representation and a deep learning framework for 3D action recognition using RGB-D sensors. We propose to build an action map called SPMF (Skeleton Posture-Motion Feature), which is a compact image representation built from skeleton poses and their motions. An Adaptive Histogram Equalization (AHE) algorithm is then applied on the SPMF to enhance their local patterns and form an enhanced action map, namely Enhanced-SPMF. For learning and classification tasks, we exploit Deep Convolutional Neural Networks based on the DenseNet architecture to learn directly an end-to-end mapping between input skeleton sequences and their action labels via the Enhanced-SPMFs. The proposed method is evaluated on four challenging benchmark datasets, including both individual actions, interactions, multiview and large-scale datasets. The experimental results demonstrate that the proposed method outperforms previous state-of-the-art approaches on all benchmark tasks, whilst requiring low computational time for training and inference.

Download Full-text

Human action recognition based on improved motion history image and deep convolutional neural networks

2017 10th International Congress on Image and Signal Processing, BioMedical Engineering and Informatics (CISP-BMEI) ◽

10.1109/cisp-bmei.2017.8302061 ◽

2017 ◽

Cited By ~ 2

Author(s):

Qiuping Chun ◽

Erhu Zhang

Keyword(s):

Neural Networks ◽

Convolutional Neural Networks ◽

Action Recognition ◽

Human Action Recognition ◽

Human Action ◽

Deep Convolutional Neural Networks ◽

Motion History Image

Download Full-text

Deep Convolutional Neural Networks for Human Action Recognition Using Depth Maps and Postures

IEEE Transactions on Systems Man and Cybernetics Systems ◽

10.1109/tsmc.2018.2850149 ◽

2019 ◽

Vol 49 (9) ◽

pp. 1806-1819 ◽

Cited By ~ 36

Author(s):

Aouaidjia Kamel ◽

Bin Sheng ◽

Po Yang ◽

Ping Li ◽

Ruimin Shen ◽

...

Keyword(s):

Neural Networks ◽

Convolutional Neural Networks ◽

Action Recognition ◽

Human Action Recognition ◽

Human Action ◽

Deep Convolutional Neural Networks ◽

Depth Maps

Download Full-text

Human action recognition using support vector machines and 3D convolutional neural networks

International Journal of Advances in Intelligent Informatics ◽

10.26555/ijain.v3i1.89 ◽

2017 ◽

Vol 3 (1) ◽

pp. 47 ◽

Cited By ~ 10

Author(s):

Majd Latah

Keyword(s):

Neural Networks ◽

Support Vector Machines ◽

Convolutional Neural Networks ◽

Action Recognition ◽

Recognition Task ◽

Human Action Recognition ◽

Human Action ◽

Support Vector ◽

Deep Convolutional Neural Networks ◽

Vector Machines

Recently, deep learning approach has been used widely in order to enhance the recognition accuracy with different application areas. In this paper, both of deep convolutional neural networks (CNN) and support vector machines approach were employed in human action recognition task. Firstly, 3D CNN approach was used to extract spatial and temporal features from adjacent video frames. Then, support vector machines approach was used in order to classify each instance based on previously extracted features. Both of the number of CNN layers and the resolution of the input frames were reduced to meet the limited memory constraints. The proposed architecture was trained and evaluated on KTH action recognition dataset and achieved a good performance.

Download Full-text