Discriminative Action Recognition Using Supervised Latent Topic Model

We present a discriminative learning method for human action recognition from video sequences. Our model combines a bag-of-words component with supervised latent topic models. The supervised latent Dirichlet allocation (sLDA) topic model, which employs discriminative learning using labeled data under a generative framework, is introduced to discover the latent topic structure which is most relevant to action categorization. We test our algorithm on two challenging datasets. Experimental results demonstrate the effectiveness of our algorithm.

Download Full-text

A Robust Deep Model for Human Action Recognition in Restricted Video Sequences

2020 43rd International Conference on Telecommunications and Signal Processing (TSP) ◽

10.1109/tsp49548.2020.9163464 ◽

2020 ◽

Author(s):

Vahid Ashkani Chenarlogh ◽

Hossein B Jond ◽

Jan Platos

Keyword(s):

Action Recognition ◽

Human Action Recognition ◽

Human Action ◽

Video Sequences ◽

Deep Model

Download Full-text

Hierarchical dynamic depth projected difference images–based action recognition in videos with convolutional neural networks

International Journal of Advanced Robotic Systems ◽

10.1177/1729881418825093 ◽

2019 ◽

Vol 16 (1) ◽

pp. 172988141882509 ◽

Cited By ~ 3

Author(s):

Hanbo Wu ◽

Xin Ma ◽

Yibin Li

Keyword(s):

Neural Networks ◽

Convolutional Neural Networks ◽

Action Recognition ◽

Human Action Recognition ◽

Human Action ◽

Temporal Information ◽

Superior Performance ◽

Video Sequences ◽

Depth Video ◽

Difference Images

Temporal information plays a significant role in video-based human action recognition. How to effectively extract the spatial–temporal characteristics of actions in videos has always been a challenging problem. Most existing methods acquire spatial and temporal cues in videos individually. In this article, we propose a new effective representation for depth video sequences, called hierarchical dynamic depth projected difference images that can aggregate the action spatial and temporal information simultaneously at different temporal scales. We firstly project depth video sequences onto three orthogonal Cartesian views to capture the 3D shape and motion information of human actions. Hierarchical dynamic depth projected difference images are constructed with the rank pooling in each projected view to hierarchically encode the spatial–temporal motion dynamics in depth videos. Convolutional neural networks can automatically learn discriminative features from images and have been extended to video classification because of their superior performance. To verify the effectiveness of hierarchical dynamic depth projected difference images representation, we construct a hierarchical dynamic depth projected difference images–based action recognition framework where hierarchical dynamic depth projected difference images in three views are fed into three identical pretrained convolutional neural networks independently for finely retuning. We design three classification schemes in the framework and different schemes utilize different convolutional neural network layers to compare their effects on action recognition. Three views are combined to describe the actions more comprehensively in each classification scheme. The proposed framework is evaluated on three challenging public human action data sets. Experiments indicate that our method has better performance and can provide discriminative spatial–temporal information for human action recognition in depth videos.

Download Full-text

Adaptive Self-Occlusion Behavior Recognition Based on pLSA

Journal of Applied Mathematics ◽

10.1155/2013/506752 ◽

2013 ◽

Vol 2013 ◽

pp. 1-9 ◽

Cited By ~ 3

Author(s):

Hong-bin Tu ◽

Li-min Xia ◽

Lun-zheng Tan

Keyword(s):

Random Field ◽

Action Recognition ◽

Markov Random Field ◽

Semantic Analysis ◽

Topic Model ◽

Human Action Recognition ◽

Human Action ◽

Probabilistic Latent Semantic Analysis ◽

Behavior Recognition ◽

Markov Random

Human action recognition is an important area of human action recognition research. Focusing on the problem of self-occlusion in the field of human action recognition, a new adaptive occlusion state behavior recognition approach was presented based on Markov random field and probabilistic Latent Semantic Analysis (pLSA). Firstly, the Markov random field was used to represent the occlusion relationship between human body parts in terms an occlusion state variable by phase space obtained. Then, we proposed a hierarchical area variety model. Finally, we use the topic model of pLSA to recognize the human behavior. Experiments were performed on the KTH, Weizmann, and Humaneva dataset to test and evaluate the proposed method. The compared experiment results showed that what the proposed method can achieve was more effective than the compared methods.

Download Full-text

Human action recognition using topic model

2014 4th IEEE International Conference on Information Science and Technology ◽

10.1109/icist.2014.6920572 ◽

2014 ◽

Author(s):

Qian Xiao ◽

Jun Cheng

Keyword(s):

Action Recognition ◽

Topic Model ◽

Human Action Recognition ◽

Human Action

Download Full-text

Human action recognition in crowded surveillance video sequences by using features taken from key-point trajectories

CVPR 2011 WORKSHOPS ◽

10.1109/cvprw.2011.5981713 ◽

2011 ◽

Cited By ~ 10

Author(s):

Masaki Takahashi ◽

Masahide Naemura ◽

Mahito Fujii ◽

Shin'ichi Satoh

Keyword(s):

Action Recognition ◽

Human Action Recognition ◽

Human Action ◽

Video Sequences ◽

Surveillance Video ◽

Point Trajectories

Download Full-text

Patient Monitoring by Abnormal Human Activity Recognition Based on CNN Architecture

Electronics ◽

10.3390/electronics9121993 ◽

2020 ◽

Vol 9 (12) ◽

pp. 1993

Author(s):

Malik Ali Gul ◽

Muhammad Haroon Yousaf ◽

Shah Nawaz ◽

Zaka Ur Rehman ◽

HyungWon Kim

Keyword(s):

Activity Recognition ◽

Action Recognition ◽

Human Activity ◽

Patient Monitoring ◽

Human Action Recognition ◽

Confidence Score ◽

Human Action ◽

Human Activity Recognition ◽

Video Sequences ◽

Human Actions

Human action recognition has emerged as a challenging research domain for video understanding and analysis. Subsequently, extensive research has been conducted to achieve the improved performance for recognition of human actions. Human activity recognition has various real time applications, such as patient monitoring in which patients are being monitored among a group of normal people and then identified based on their abnormal activities. Our goal is to render a multi class abnormal action detection in individuals as well as in groups from video sequences to differentiate multiple abnormal human actions. In this paper, You Look only Once (YOLO) network is utilized as a backbone CNN model. For training the CNN model, we constructed a large dataset of patient videos by labeling each frame with a set of patient actions and the patient’s positions. We retrained the back-bone CNN model with 23,040 labeled images of patient’s actions for 32 epochs. Across each frame, the proposed model allocated a unique confidence score and action label for video sequences by finding the recurrent action label. The present study shows that the accuracy of abnormal action recognition is 96.8%. Our proposed approach differentiated abnormal actions with improved F1-Score of 89.2% which is higher than state-of-the-art techniques. The results indicate that the proposed framework can be beneficial to hospitals and elder care homes for patient monitoring.

Download Full-text

Human Action Recognition in Video Sequences Using Deep Belief Networks

Traitement du signal ◽

10.18280/ts.370105 ◽

2020 ◽

Vol 37 (1) ◽

pp. 37-44

Author(s):

Mehrez Abdellaoui ◽

Ali Douik

Keyword(s):

Action Recognition ◽

Human Action Recognition ◽

Human Action ◽

Video Sequences ◽

Belief Networks ◽

Deep Belief Networks

Download Full-text

Semi-Latent Dirichlet Allocation: A Hierarchical Model for Human Action Recognition

Human Motion – Understanding, Modeling, Capture and Animation - Lecture Notes in Computer Science ◽

10.1007/978-3-540-75703-0_17 ◽

2007 ◽

pp. 240-254 ◽

Cited By ~ 40

Author(s):

Yang Wang ◽

Payam Sabzmeydani ◽

Greg Mori

Keyword(s):

Action Recognition ◽

Hierarchical Model ◽

Latent Dirichlet Allocation ◽

Human Action Recognition ◽

Human Action ◽

Dirichlet Allocation

Download Full-text

Human Action Recognition Based on a Spatio-Temporal Video Autoencoder

International Journal of Pattern Recognition and Artificial Intelligence ◽

10.1142/s0218001420400017 ◽

2020 ◽

Vol 34 (11) ◽

pp. 2040001

Author(s):

Anderson Carlos Sousa e Santos ◽

Helio Pedrini

Keyword(s):

Action Recognition ◽

Low Cost ◽

Human Action Recognition ◽

Human Action ◽

Video Sequences ◽

Computer Vision Applications ◽

Recognition Systems ◽

Spatio Temporal ◽

Human Operators ◽

Accuracy Rates

Due to rapid advances in the development of surveillance cameras with high sampling rates, low cost, small size and high resolution, video-based action recognition systems have become more commonly used in various computer vision applications. Human operators can be supported with the aid of such systems to detect events of interest in video sequences, improving recognition results and reducing failure cases. In this work, we propose and evaluate a method to learn two-dimensional (2D) representations from video sequences based on an autoencoder framework. Spatial and temporal information is explored through a multi-stream convolutional neural network in the context of human action recognition. Experimental results on the challenging UCF101 and HMDB51 datasets demonstrate that our representation is capable of achieving competitive accuracy rates when compared to other approaches available in the literature.

Download Full-text