A non-linear mapping representing human action recognition under missing modality problem in video data

Measurement ◽  
2021 ◽  
pp. 110123
Author(s):  
Aidin Gharahdaghi ◽  
Farbod Razzazi ◽  
Arash Amini
2021 ◽  
Vol 11 (11) ◽  
pp. 4940
Author(s):  
Jinsoo Kim ◽  
Jeongho Cho

The field of research related to video data has difficulty in extracting not only spatial but also temporal features and human action recognition (HAR) is a representative field of research that applies convolutional neural network (CNN) to video data. The performance for action recognition has improved, but owing to the complexity of the model, some still limitations to operation in real-time persist. Therefore, a lightweight CNN-based single-stream HAR model that can operate in real-time is proposed. The proposed model extracts spatial feature maps by applying CNN to the images that develop the video and uses the frame change rate of sequential images as time information. Spatial feature maps are weighted-averaged by frame change, transformed into spatiotemporal features, and input into multilayer perceptrons, which have a relatively lower complexity than other HAR models; thus, our method has high utility in a single embedded system connected to CCTV. The results of evaluating action recognition accuracy and data processing speed through challenging action recognition benchmark UCF-101 showed higher action recognition accuracy than the HAR model using long short-term memory with a small amount of video frames and confirmed the real-time operational possibility through fast data processing speed. In addition, the performance of the proposed weighted mean-based HAR model was verified by testing it in Jetson NANO to confirm the possibility of using it in low-cost GPU-based embedded systems.


2021 ◽  
pp. 1-12
Author(s):  
Hongzhong Hei ◽  
Xianzhong Jian ◽  
Erliang Xiao

The widespread application of infrared human action recognition in intelligent surveillance has attracted significant attention. However, the infrared action recognition dataset is limited, which limits the development of infrared action recognition. Existing methods for infrared action recognition are based on features in the same sample, without paying attention to within-class differences. Motivated by the idea of weighting video information, this paper proposes a novel infrared action recognition framework to reweight the samples of training sets named REWS to solve the problems of limited infrared action data and the large within-class differences in the infrared action recognition dataset. In the proposed framework, we first map infrared action video data to a low-dimensional feature space, and use the cosine similarity between the feature data of the training set and the testing set to determine the weight of the training set samples. Each training set sample has an independent weight. Then, a support vector machine (SVM) is trained by the training sets with weights to recognize the infrared actions. Experimental results demonstrate that our approach can achieve state-of-the-art performance compared with hand-crafted features based methods on the benchmark InfAR dataset.


2014 ◽  
Author(s):  
Karla Brkić ◽  
Srđan Rašić ◽  
Axel Pinz ◽  
Siniša Šegvić ◽  
Zoran Kalafatić

2014 ◽  
Vol 44 (5) ◽  
pp. 650-663 ◽  
Author(s):  
Manoj Ramanathan ◽  
Wei-Yun Yau ◽  
Eam Khwang Teoh

2013 ◽  
Vol 18 (2-3) ◽  
pp. 49-60 ◽  
Author(s):  
Damian Dudzńiski ◽  
Tomasz Kryjak ◽  
Zbigniew Mikrut

Abstract In this paper a human action recognition algorithm, which uses background generation with shadow elimination, silhouette description based on simple geometrical features and a finite state machine for recognizing particular actions is described. The performed tests indicate that this approach obtains a 81 % correct recognition rate allowing real-time image processing of a 360 X 288 video stream.


2018 ◽  
Vol 6 (10) ◽  
pp. 323-328
Author(s):  
K.Kiruba . ◽  
D. Shiloah Elizabeth ◽  
C Sunil Retmin Raj

Sign in / Sign up

Export Citation Format

Share Document