Interaction-Aware Spatio-Temporal Pyramid Attention Networks for Action Classification

Computer Vision – ECCV 2018 - Lecture Notes in Computer Science ◽

10.1007/978-3-030-01270-0_23 ◽

2018 ◽

pp. 388-404 ◽

Cited By ~ 14

Author(s):

Yang Du ◽

Chunfeng Yuan ◽

Bing Li ◽

Lili Zhao ◽

Yangxi Li ◽

...

Keyword(s):

Action Classification ◽

Attention Networks ◽

Spatio Temporal

Download Full-text

Human action classification using surf based spatio-temporal correlated descriptors

2012 19th IEEE International Conference on Image Processing ◽

10.1109/icip.2012.6467131 ◽

2012 ◽

Author(s):

A. Q. Md Sabri ◽

J. Boonaert ◽

S. Lecoeuche ◽

E. Mouaddib

Keyword(s):

Human Action ◽

Action Classification ◽

Spatio Temporal

Download Full-text

Multi-Term Attention Networks for Skeleton-Based Action Recognition

Applied Sciences ◽

10.3390/app10155326 ◽

2020 ◽

Vol 10 (15) ◽

pp. 5326

Author(s):

Xiaolei Diao ◽

Xiaoqiang Li ◽

Chen Huang

Keyword(s):

Neural Network ◽

Time Scales ◽

Action Recognition ◽

State Of The Art ◽

Attention Networks ◽

Weighted Fusion ◽

Temporal Features ◽

Benchmark Datasets ◽

Spatio Temporal ◽

Different Time Scales

The same action takes different time in different cases. This difference will affect the accuracy of action recognition to a certain extent. We propose an end-to-end deep neural network called “Multi-Term Attention Networks” (MTANs), which solves the above problem by extracting temporal features with different time scales. The network consists of a Multi-Term Attention Recurrent Neural Network (MTA-RNN) and a Spatio-Temporal Convolutional Neural Network (ST-CNN). In MTA-RNN, a method for fusing multi-term temporal features are proposed to extract the temporal dependence of different time scales, and the weighted fusion temporal feature is recalibrated by the attention mechanism. Ablation research proves that this network has powerful spatio-temporal dynamic modeling capabilities for actions with different time scales. We perform extensive experiments on four challenging benchmark datasets, including the NTU RGB+D dataset, UT-Kinect dataset, Northwestern-UCLA dataset, and UWA3DII dataset. Our method achieves better results than the state-of-the-art benchmarks, which demonstrates the effectiveness of MTANs.

Download Full-text

A 127mW 1.63TOPS sparse spatio-temporal cognitive SoC for action classification and motion tracking in videos

2017 Symposium on VLSI Circuits ◽

10.23919/vlsic.2017.8008488 ◽

2017 ◽

Cited By ~ 1

Author(s):

Ching-En Lee ◽

Thomas Chen ◽

Zhengya Zhang

Keyword(s):

Motion Tracking ◽

Action Classification ◽

Spatio Temporal

Download Full-text

Video Question Answering via Hierarchical Spatio-Temporal Attention Networks

Proceedings of the Twenty-Sixth International Joint Conference on Artificial Intelligence ◽

10.24963/ijcai.2017/492 ◽

2017 ◽

Cited By ~ 23

Author(s):

Zhou Zhao ◽

Qifan Yang ◽

Deng Cai ◽

Xiaofei He ◽

Yueting Zhuang

Keyword(s):

Visual Information ◽

Large Scale ◽

Question Answering ◽

Temporal Dynamics ◽

Temporal Attention ◽

Attention Networks ◽

Learning Framework ◽

Spatio Temporal ◽

The Given ◽

Video Question Answering

Open-ended video question answering is a challenging problem in visual information retrieval, which automatically generates the natural language answer from the referenced video content according to the question. However, the existing visual question answering works only focus on the static image, which may be ineffectively applied to video question answering due to the temporal dynamics of video contents. In this paper, we consider the problem of open-ended video question answering from the viewpoint of spatio-temporal attentional encoder-decoder learning framework. We propose the hierarchical spatio-temporal attention network for learning the joint representation of the dynamic video contents according to the given question. We then develop the encoder-decoder learning method with reasoning recurrent neural networks for open-ended video question answering. We construct a large-scale video question answering dataset. The extensive experiments show the effectiveness of our method.

Download Full-text

Spatio-temporal SIFT and Its Application to Human Action Classification

Computer Vision – ECCV 2012. Workshops and Demonstrations - Lecture Notes in Computer Science ◽

10.1007/978-3-642-33863-2_30 ◽

2012 ◽

pp. 301-310 ◽

Cited By ~ 8

Author(s):

Manal Al Ghamdi ◽

Lei Zhang ◽

Yoshihiko Gotoh

Keyword(s):

Human Action ◽

Action Classification ◽

Spatio Temporal

Download Full-text

Spatio-Temporal Attention Networks for Action Recognition and Detection

IEEE Transactions on Multimedia ◽

10.1109/tmm.2020.2965434 ◽

2020 ◽

Vol 22 (11) ◽

pp. 2990-3001 ◽

Cited By ~ 2

Author(s):

Jun Li ◽

Xianglong Liu ◽

Wenxuan Zhang ◽

Mingyuan Zhang ◽

Jingkuan Song ◽

...

Keyword(s):

Action Recognition ◽

Temporal Attention ◽

Attention Networks ◽

Spatio Temporal

Download Full-text

3D Action Classification Using Sparse Spatio-temporal Feature Representations

Advances in Visual Computing - Lecture Notes in Computer Science ◽

10.1007/978-3-642-33191-6_17 ◽

2012 ◽

pp. 166-175 ◽

Cited By ~ 5

Author(s):

Sherif Azary ◽

Andreas Savakis

Keyword(s):

Action Classification ◽

Feature Representations ◽

Spatio Temporal ◽

Temporal Feature

Download Full-text

Unified Spatio-Temporal Attention Networks for Action Recognition in Videos

IEEE Transactions on Multimedia ◽

10.1109/tmm.2018.2862341 ◽

2019 ◽

Vol 21 (2) ◽

pp. 416-428 ◽

Cited By ~ 19

Author(s):

Dong Li ◽

Ting Yao ◽

Ling-Yu Duan ◽

Tao Mei ◽

Yong Rui

Keyword(s):

Action Recognition ◽

Temporal Attention ◽

Attention Networks ◽

Spatio Temporal

Download Full-text

Video Action Classification: A New Approach Combining Spatio-temporal Krawtchouk Moments and Laplacian Eigenmaps

2011 Seventh International Conference on Signal Image Technology & Internet-Based Systems ◽

10.1109/sitis.2011.65 ◽

2011 ◽

Cited By ~ 5

Author(s):

Imen Lassoued ◽

Ezzeddine Zagrouba ◽

Youssef Chahir

Keyword(s):

Action Classification ◽

Laplacian Eigenmaps ◽

New Approach ◽

Krawtchouk Moments ◽

Spatio Temporal

Download Full-text