Human Action Recognition Using Action Bank Features and Convolutional Neural Networks

Temporal information plays a significant role in video-based human action recognition. How to effectively extract the spatial–temporal characteristics of actions in videos has always been a challenging problem. Most existing methods acquire spatial and temporal cues in videos individually. In this article, we propose a new effective representation for depth video sequences, called hierarchical dynamic depth projected difference images that can aggregate the action spatial and temporal information simultaneously at different temporal scales. We firstly project depth video sequences onto three orthogonal Cartesian views to capture the 3D shape and motion information of human actions. Hierarchical dynamic depth projected difference images are constructed with the rank pooling in each projected view to hierarchically encode the spatial–temporal motion dynamics in depth videos. Convolutional neural networks can automatically learn discriminative features from images and have been extended to video classification because of their superior performance. To verify the effectiveness of hierarchical dynamic depth projected difference images representation, we construct a hierarchical dynamic depth projected difference images–based action recognition framework where hierarchical dynamic depth projected difference images in three views are fed into three identical pretrained convolutional neural networks independently for finely retuning. We design three classification schemes in the framework and different schemes utilize different convolutional neural network layers to compare their effects on action recognition. Three views are combined to describe the actions more comprehensively in each classification scheme. The proposed framework is evaluated on three challenging public human action data sets. Experiments indicate that our method has better performance and can provide discriminative spatial–temporal information for human action recognition in depth videos.

Download Full-text

Towards Improved Human Action Recognition Using Convolutional Neural Networks and Multimodal Fusion of Depth and Inertial Sensor Data

2018 IEEE International Symposium on Multimedia (ISM) ◽

10.1109/ism.2018.000-2 ◽

2018 ◽

Cited By ~ 9

Author(s):

Zeeshan Ahmad ◽

Naimul Khan

Keyword(s):

Neural Networks ◽

Convolutional Neural Networks ◽

Action Recognition ◽

Inertial Sensor ◽

Human Action Recognition ◽

Human Action ◽

Multimodal Fusion ◽

Sensor Data

Download Full-text

Real-time human action recognition using depth motion maps and convolutional neural networks

International Journal of High Performance Computing and Networking ◽

10.1504/ijhpcn.2016.10011433 ◽

2016 ◽

Vol 1 (1) ◽

pp. 1 ◽

Cited By ~ 1

Author(s):

Yitong Li ◽

Xiaojuan Ban ◽

Guang Yang ◽

Jiang Li

Keyword(s):

Neural Networks ◽

Real Time ◽

Convolutional Neural Networks ◽

Action Recognition ◽

Human Action Recognition ◽

Human Action ◽

Depth Motion Maps

Download Full-text

Human Action Recognition by Fusion of Convolutional Neural Networks and spatial-temporal Information

Proceedings of the International Conference on Internet Multimedia Computing and Service - ICIMCS'16 ◽

10.1145/3007669.3007702 ◽

2016 ◽

Cited By ~ 1

Author(s):

Weisheng Li ◽

Yahui Ding

Keyword(s):

Neural Networks ◽

Convolutional Neural Networks ◽

Action Recognition ◽

Human Action Recognition ◽

Human Action ◽

Temporal Information

Download Full-text

Skeleton-Based Human Action Recognition Using Spatial Temporal 3D Convolutional Neural Networks

2018 IEEE International Conference on Multimedia and Expo (ICME) ◽

10.1109/icme.2018.8486566 ◽

2018 ◽

Cited By ~ 3

Author(s):

Juanhui Tu ◽

Mengyuan Liu ◽

Hong Liu

Keyword(s):

Neural Networks ◽

Convolutional Neural Networks ◽

Action Recognition ◽

Human Action Recognition ◽

Human Action

Download Full-text

Human Action Recognition Based on Recognition of Linear Patterns in Action Bank Features Using Convolutional Neural Networks

2014 13th International Conference on Machine Learning and Applications ◽

10.1109/icmla.2014.33 ◽

2014 ◽

Cited By ~ 5

Author(s):

Earnest Paul Ijjina ◽

C. Krishna Mohan

Keyword(s):

Neural Networks ◽

Convolutional Neural Networks ◽

Action Recognition ◽

Human Action Recognition ◽

Human Action

Download Full-text