Human Action Recognition Based on Foreground Trajectory and Motion Difference Descriptors

Aimed at the problems of high redundancy of trajectory and susceptibility to background interference in traditional dense trajectory behavior recognition methods, a human action recognition method based on foreground trajectory and motion difference descriptors is proposed. First, the motion magnitude of each frame is estimated by optical flow, and the foreground region is determined according to each motion magnitude of the pixels; the trajectories are only extracted from behavior-related foreground regions. Second, in order to better describe the relative temporal information between different actions, a motion difference descriptor is introduced to describe the foreground trajectory, and the direction histogram of the motion difference is constructed by calculating the direction information of the motion difference per unit time of the trajectory point. Finally, a Fisher vector (FV) is used to encode histogram features to obtain video-level action features, and a support vector machine (SVM) is utilized to classify the action category. Experimental results show that this method can better extract the action-related trajectory, and it can improve the recognition accuracy by 7% compared to the traditional dense trajectory method.

Download Full-text

Human Action Recognition: A Dense Trajectory and Similarity Constrained Latent Support Vector Machine Approach

2013 2nd IAPR Asian Conference on Pattern Recognition ◽

10.1109/acpr.2013.65 ◽

2013 ◽

Cited By ~ 1

Author(s):

Sio-Long Lo ◽

Ah-Chung Tsoi

Keyword(s):

Support Vector Machine ◽

Action Recognition ◽

Human Action Recognition ◽

Human Action ◽

Support Vector ◽

Dense Trajectory

Download Full-text

Human action recognition method based on hierarchical framework via Kinect skeleton data

2017 International Conference on Machine Learning and Cybernetics (ICMLC) ◽

10.1109/icmlc.2017.8107747 ◽

2017 ◽

Cited By ~ 3

Author(s):

Benyue Su ◽

Huang Wu ◽

Min Sheng

Keyword(s):

Action Recognition ◽

Human Action Recognition ◽

Human Action ◽

Recognition Method ◽

Hierarchical Framework

Download Full-text

Human action recognition with group lasso regularized-support vector machine

Journal of Electronic Imaging ◽

10.1117/1.jei.25.3.033015 ◽

2016 ◽

Vol 25 (3) ◽

pp. 033015 ◽

Cited By ~ 2

Author(s):

Huiwu Luo ◽

Huanzhang Lu ◽

Yabei Wu ◽

Fei Zhao

Keyword(s):

Support Vector Machine ◽

Action Recognition ◽

Human Action Recognition ◽

Human Action ◽

Group Lasso ◽

Support Vector

Download Full-text

A Novel 3D Human Action Recognition Method Based on Part Affinity Fields

Communications in Computer and Information Science - Embedded Systems Technology ◽

10.1007/978-981-13-1026-3_14 ◽

2018 ◽

pp. 181-192

Author(s):

Haipeng Dong ◽

Qingqing Meng ◽

Tao Hu

Keyword(s):

Action Recognition ◽

Human Action Recognition ◽

Human Action ◽

Recognition Method

Download Full-text

Hybrid Feature Vector-Assisted Action Representation for Human Action Recognition Using Support Vector Machines

Methodologies and Applications of Computational Statistics for Machine Intelligence - Advances in Systems Analysis, Software Engineering, and High Performance Computing ◽

10.4018/978-1-7998-7701-1.ch001 ◽

2021 ◽

pp. 1-22

Author(s):

L. Nirmala Devi ◽

A.Nageswar Rao

Keyword(s):

Action Recognition ◽

Feature Vector ◽

Learning Algorithm ◽

Gabor Filter ◽

Principal Component ◽

Human Action Recognition ◽

Human Action ◽

Visual Surveillance ◽

Support Vector ◽

Significant Research

Human action recognition (HAR) is one of most significant research topics, and it has attracted the concentration of many researchers. Automatic HAR system is applied in several fields like visual surveillance, data retrieval, healthcare, etc. Based on this inspiration, in this chapter, the authors propose a new HAR model that considers an image as input and analyses and exposes the action present in it. Under the analysis phase, they implement two different feature extraction methods with the help of rotation invariant Gabor filter and edge adaptive wavelet filter. For every action image, a new vector called as composite feature vector is formulated and then subjected to dimensionality reduction through principal component analysis (PCA). Finally, the authors employ the most popular supervised machine learning algorithm (i.e., support vector machine [SVM]) for classification. Simulation is done over two standard datasets; they are KTH and Weizmann, and the performance is measured through an accuracy metric.

Download Full-text

Feature Fusion of Deep Spatial Features and Handcrafted Spatiotemporal Features for Human Action Recognition

Sensors ◽

10.3390/s19071599 ◽

2019 ◽

Vol 19 (7) ◽

pp. 1599 ◽

Cited By ~ 6

Author(s):

Md Uddin ◽

Young-Koo Lee

Keyword(s):

Action Recognition ◽

State Of The Art ◽

Human Action Recognition ◽

Human Action ◽

Support Vector ◽

Feature Descriptor ◽

Weber’S Law ◽

Weber's Law ◽

Spatiotemporal Features ◽

Spatial Features

Human action recognition plays a significant part in the research community due to its emerging applications. A variety of approaches have been proposed to resolve this problem, however, several issues still need to be addressed. In action recognition, effectively extracting and aggregating the spatial-temporal information plays a vital role to describe a video. In this research, we propose a novel approach to recognize human actions by considering both deep spatial features and handcrafted spatiotemporal features. Firstly, we extract the deep spatial features by employing a state-of-the-art deep convolutional network, namely Inception-Resnet-v2. Secondly, we introduce a novel handcrafted feature descriptor, namely Weber’s law based Volume Local Gradient Ternary Pattern (WVLGTP), which brings out the spatiotemporal features. It also considers the shape information by using gradient operation. Furthermore, Weber’s law based threshold value and the ternary pattern based on an adaptive local threshold is presented to effectively handle the noisy center pixel value. Besides, a multi-resolution approach for WVLGTP based on an averaging scheme is also presented. Afterward, both these extracted features are concatenated and feed to the Support Vector Machine to perform the classification. Lastly, the extensive experimental analysis shows that our proposed method outperforms state-of-the-art approaches in terms of accuracy.

Download Full-text

A Set of New Hermite Kernel Functions in Kernel Extreme Learning Machine and Application in Human Action Recognition

International Journal of Pattern Recognition and Artificial Intelligence ◽

10.1142/s0218001419550140 ◽

2019 ◽

Vol 33 (12) ◽

pp. 1955014 ◽

Cited By ~ 1

Author(s):

Xueping Liu ◽

Xingzuo Yue

Keyword(s):

Extreme Learning Machine ◽

Action Recognition ◽

Structural Information ◽

Image Data ◽

Human Action Recognition ◽

Human Action ◽

Kernel Functions ◽

Support Vector ◽

Learning Speed ◽

Learning Machine

The kernel function has been successfully utilized in the extreme learning machine (ELM) that provides a stabilized and generalized performance and greatly reduces the computational complexity. However, the selection and optimization of the parameters constituting the most common kernel functions are tedious and time-consuming. In this study, a set of new Hermit kernel functions derived from the generalized Hermit polynomials has been proposed. The significant contributions of the proposed kernel include only one parameter selected from a small set of natural numbers; thus, the parameter optimization is greatly facilitated and excessive structural information of the sample data is retained. Consequently, the new kernel functions can be used as optimal alternatives to other common kernel functions for ELM at a rapid learning speed. The experimental results showed that the proposed kernel ELM method tends to have similar or better robustness and generalized performance at a faster learning speed than the other common kernel ELM and support vector machine methods. Consequently, when applied to human action recognition by depth video sequence, the method also achieves excellent performance, demonstrating its time-based advantage on the video image data.

Download Full-text

Human Action Recognition Using Improved Salient Dense Trajectories

Computational Intelligence and Neuroscience ◽

10.1155/2016/6750459 ◽

2016 ◽

Vol 2016 ◽

pp. 1-11 ◽

Cited By ~ 3

Author(s):

Qingwu Li ◽

Haisu Cheng ◽

Yan Zhou ◽

Guanying Huo

Keyword(s):

Action Recognition ◽

State Of The Art ◽

Human Action Recognition ◽

Human Action ◽

Interest Points ◽

Dense Trajectories ◽

Dense Trajectory ◽

Sparse Coefficient ◽

Active Research ◽

Motion Saliency

Human action recognition in videos is a topic of active research in computer vision. Dense trajectory (DT) features were shown to be efficient for representing videos in state-of-the-art approaches. In this paper, we present a more effective approach of video representation using improved salient dense trajectories: first, detecting the motion salient region and extracting the dense trajectories by tracking interest points in each spatial scale separately and then refining the dense trajectories via the analysis of the motion saliency. Then, we compute several descriptors (i.e., trajectory displacement, HOG, HOF, and MBH) in the spatiotemporal volume aligned with the trajectories. Finally, in order to represent the videos better, we optimize the framework of bag-of-words according to the motion salient intensity distribution and the idea of sparse coefficient reconstruction. Our architecture is trained and evaluated on the four standard video actions datasets of KTH, UCF sports, HMDB51, and UCF50, and the experimental results show that our approach performs competitively comparing with the state-of-the-art results.

Download Full-text