Joint Dynamic Pose Image and Space Time Reversal for Human Action Recognition from Videos

Human action recognition aims to classify a given video according to which type of action it contains. Disturbance brought by clutter background and unrelated motions makes the task challenging for video frame-based methods. To solve this problem, this paper takes advantage of pose estimation to enhance the performances of video frame features. First, we present a pose feature called dynamic pose image (DPI), which describes human action as the aggregation of a sequence of joint estimation maps. Different from traditional pose features using sole joints, DPI suffers less from disturbance and provides richer information about human body shape and movements. Second, we present attention-based dynamic texture images (att-DTIs) as pose-guided video frame feature. Specifically, a video is treated as a space-time volume, and DTIs are obtained by observing the volume from different views. To alleviate the effect of disturbance on DTIs, we accumulate joint estimation maps as attention map, and extend DTIs to attention-based DTIs (att-DTIs). Finally, we fuse DPI and att-DTIs with multi-stream deep neural networks and late fusion scheme for action recognition. Experiments on NTU RGB+D, UTD-MHAD, and Penn-Action datasets show the effectiveness of DPI and att-DTIs, as well as the complementary property between them.

Download Full-text

Multi-modal human action recognition using deep neural networks fusing image and inertial sensor data

2017 IEEE International Conference on Multisensor Fusion and Integration for Intelligent Systems (MFI) ◽

10.1109/mfi.2017.8170441 ◽

2017 ◽

Cited By ~ 4

Author(s):

Inhwan Hwang ◽

Geonho Cha ◽

Songhwai Oh

Keyword(s):

Neural Networks ◽

Action Recognition ◽

Deep Neural Networks ◽

Inertial Sensor ◽

Human Action Recognition ◽

Human Action ◽

Sensor Data

Download Full-text

A Hierarchical Bag-of-Words Model Based on Local Space-Time Features for Human Action Recognition

2013 International Conference on IT Convergence and Security (ICITCS) ◽

10.1109/icitcs.2013.6717776 ◽

2013 ◽

Cited By ~ 1

Author(s):

Jiangwei Wu ◽

Daobing Zhou ◽

Guoqiang Xiao

Keyword(s):

Action Recognition ◽

Human Action Recognition ◽

Human Action ◽

Space Time ◽

Bag Of Words ◽

Model Based ◽

Local Space

Download Full-text

Human‐action recognition using a multi‐layered fusion scheme of Kinect modalities

IET Computer Vision ◽

10.1049/iet-cvi.2016.0326 ◽

2017 ◽

Vol 11 (7) ◽

pp. 530-540 ◽

Cited By ~ 8

Author(s):

Bassem Seddik ◽

Sami Gazzah ◽

Najoua Essoukri Ben Amara

Keyword(s):

Action Recognition ◽

Human Action Recognition ◽

Human Action ◽

Fusion Scheme

Download Full-text

On the improvement of human action recognition from depth map sequences using Space–Time Occupancy Patterns

Pattern Recognition Letters ◽

10.1016/j.patrec.2013.07.011 ◽

2014 ◽

Vol 36 ◽

pp. 221-227 ◽

Cited By ~ 41

Author(s):

Antonio W. Vieira ◽

Erickson R. Nascimento ◽

Gabriel L. Oliveira ◽

Zicheng Liu ◽

Mario F.M. Campos

Keyword(s):

Action Recognition ◽

Depth Map ◽

Human Action Recognition ◽

Human Action ◽

Space Time ◽

Occupancy Patterns

Download Full-text

Space-Time Pose Representation for 3D Human Action Recognition

New Trends in Image Analysis and Processing – ICIAP 2013 - Lecture Notes in Computer Science ◽

10.1007/978-3-642-41190-8_49 ◽

2013 ◽

pp. 456-464 ◽

Cited By ~ 28

Author(s):

Maxime Devanne ◽

Hazem Wannous ◽

Stefano Berretti ◽

Pietro Pala ◽

Mohamed Daoudi ◽

...

Keyword(s):

Action Recognition ◽

Human Action Recognition ◽

Human Action ◽

Space Time

Download Full-text

Human Action Recognition by Learning Spatio-Temporal Features With Deep Neural Networks

IEEE Access ◽

10.1109/access.2018.2817253 ◽

2018 ◽

Vol 6 ◽

pp. 17913-17922 ◽

Cited By ~ 24

Author(s):

Lei Wang ◽

Yangyang Xu ◽

Jun Cheng ◽

Haiying Xia ◽

Jianqin Yin ◽

...

Keyword(s):

Neural Networks ◽

Action Recognition ◽

Deep Neural Networks ◽

Human Action Recognition ◽

Human Action ◽

Temporal Features ◽

Spatio Temporal

Download Full-text

3D pooling on local space-time features for human action recognition

2013 8th Iranian Conference on Machine Vision and Image Processing (MVIP) ◽

10.1109/iranianmvip.2013.6779992 ◽

2013 ◽

Author(s):

Najme Hadibarhaghtalab ◽

Zohreh Azimifar

Keyword(s):

Action Recognition ◽

Human Action Recognition ◽

Human Action ◽

Space Time ◽

Local Space

Download Full-text

An effective fusion scheme of spatio-temporal features for human action recognition in RGB-D video

2013 International Conference on Control, Automation and Information Sciences (ICCAIS) ◽

10.1109/iccais.2013.6720562 ◽

2013 ◽

Cited By ~ 1

Author(s):

Quang D. Tran ◽

Ngoc Q. Ly

Keyword(s):

Action Recognition ◽

Human Action Recognition ◽

Human Action ◽

Temporal Features ◽

Spatio Temporal ◽

Fusion Scheme

Download Full-text

Deep Neural Networks for Human Action Recognition

International Journal for Research in Applied Science and Engineering Technology ◽

10.22214/ijraset.2021.38206 ◽

2021 ◽

Vol 9 (9) ◽

pp. 2141-2144

Author(s):

Prof. Rajeshwari. J. Kodulkar

Keyword(s):

Neural Networks ◽

Action Recognition ◽

Deep Neural Networks ◽

Human Action Recognition ◽

Human Action ◽

Hand Movements ◽

Hand Gesture ◽

Human Action Detection ◽

Index Terms ◽

Angle Calculation

Abstract: In deep neural networks, human action detection is one of the most demanding and complex tasks. Human gesture recognition is the same as human action recognition. Gesture is defined as a series of bodily motions that communicate a message. Gestures are a more natural and preferable way for humans to engage with computers, thereby bridging the gap between humans and robots. The finest communication platform for the deaf and dumb is human action recognition. We propose in this work to create a system for hand gesture identification that recognizes hand movements, hand characteristics such as peak calculation and angle calculation, and then converts gesture photos into text. Index Terms: Human action recognition, Deaf and dumb, CNN.

Download Full-text