Action recognition with various speeds and timed-DMHI feature vectors

The feature vectors of speaker identification system plays a crucial role in the overall performance of the system. There are many new feature vectors extraction methods based on MFCC, but ultimately we want to maximize the performance of SID system. The objective of this paper to derive Gammatone Frequency Cepstral Coefficients (GFCC) based a new set of feature vectors using Gaussian Mixer model (GMM) for speaker identification. The MFCC are the default feature vectors for speaker recognition, but they are not very robust at the presence of additive noise. The GFCC features in recent studies have shown very good robustness against noise and acoustic change. The main idea is GFCC features based on GMM feature extraction is to improve the overall speaker identification performance in low signal to noise ratio (SNR) conditions.

Download Full-text

Human action recognition using simple geometric features and a finite state machine

Image Processing & Communications ◽

10.2478/v10248-012-0079-y ◽

2013 ◽

Vol 18 (2-3) ◽

pp. 49-60 ◽

Cited By ~ 2

Author(s):

Damian Dudzńiski ◽

Tomasz Kryjak ◽

Zbigniew Mikrut

Keyword(s):

Action Recognition ◽

Finite State Machine ◽

Recognition Rate ◽

Human Action Recognition ◽

Human Action ◽

Video Stream ◽

State Machine ◽

Recognition Algorithm ◽

Finite State ◽

Correct Recognition Rate

Abstract In this paper a human action recognition algorithm, which uses background generation with shadow elimination, silhouette description based on simple geometrical features and a finite state machine for recognizing particular actions is described. The performed tests indicate that this approach obtains a 81 % correct recognition rate allowing real-time image processing of a 360 X 288 video stream.

Download Full-text

Deep Learning for Human Action Recognition Survey

International Journal of Computer Sciences and Engineering ◽

10.26438/ijcse/v6i10.323328 ◽

2018 ◽

Vol 6 (10) ◽

pp. 323-328

Author(s):

K.Kiruba . ◽

D. Shiloah Elizabeth ◽

C Sunil Retmin Raj

Keyword(s):

Deep Learning ◽

Action Recognition ◽

Human Action Recognition ◽

Human Action

Download Full-text

Improving the Computational Cost for Copied Region Detection in Forensic Images

Journal of Science and Technology Issue on Information and Communications Technology ◽

10.31130/jst.2016.28 ◽

2016 ◽

Vol 2 (1) ◽

pp. 55

Author(s):

Tu Huynh-Kha ◽

Thuong Le-Tien ◽

Synh Ha ◽

Khoa Huynh-Van

Keyword(s):

Wavelet Transform ◽

Euclidean Distance ◽

Research Work ◽

Computational Cost ◽

Correlation Coefficients ◽

Zernike Moments ◽

Computational Time ◽

Discrete Wavelet ◽

Feature Vectors ◽

Region Detection

This research work develops a new method to detect the forgery in image by combining the Wavelet transform and modified Zernike Moments (MZMs) in which the features are defined from more pixels than in traditional Zernike Moments. The tested image is firstly converted to grayscale and applied one level Discrete Wavelet Transform (DWT) to reduce the size of image by a half in both sides. The approximation sub-band (LL), which is used for processing, is then divided into overlapping blocks and modified Zernike moments are calculated in each block as feature vectors. More pixels are considered, more sufficient features are extracted. Lexicographical sorting and correlation coefficients computation on feature vectors are next steps to find the similar blocks. The purpose of applying DWT to reduce the dimension of the image before using Zernike moments with updated coefficients is to improve the computational time and increase exactness in detection. Copied or duplicated parts will be detected as traces of copy-move forgery manipulation based on a threshold of correlation coefficients and confirmed exactly from the constraint of Euclidean distance. Comparisons results between proposed method and related ones prove the feasibility and efficiency of the proposed algorithm.

Download Full-text

Surgical Action Recognition with Spatiotemporal Convolutional Neural Networks

10.31256/hsmr2019.17 ◽

2019 ◽

Author(s):

Giacomo De Rossi ◽

◽

Nicola Piccinelli ◽

Francesco Setti ◽

Riccardo Muradore ◽

...

Keyword(s):

Neural Networks ◽

Convolutional Neural Networks ◽

Action Recognition

Download Full-text

Vertical View Human Action Recognition from Range Images

Proceedings of the International Display Workshops ◽

10.36463/idw.2019.1342 ◽

2019 ◽

pp. 1342

Author(s):

Akinobu Watanabe ◽

Keiichi Mitani

Keyword(s):

Action Recognition ◽

Human Action Recognition ◽

Human Action ◽

Range Images

Download Full-text

Vertical View Human Action Recognition from Range Images

Proceedings of the International Display Workshops ◽

10.36463/idw.2019.prj6_ais3-2 ◽

2019 ◽

pp. 1342

Author(s):

Akinobu Watanabe ◽

Keiichi Mitani

Keyword(s):

Action Recognition ◽

Human Action Recognition ◽

Human Action ◽

Range Images

Download Full-text

Effective video feature for action recognition

Journal of Computer Applications ◽

10.3724/sp.j.1087.2011.00406 ◽

2011 ◽

Vol 31 (2) ◽

pp. 406-409 ◽

Cited By ~ 1

Author(s):

Ying-jie LI ◽

Yi-xin YIN ◽

Fei DENG

Keyword(s):

Action Recognition ◽

Video Feature

Download Full-text

Human Action Recognition Based on Discriminative Sparse Coding Video Representation

ROBOT ◽

10.3724/sp.j.1218.2012.00745 ◽

2012 ◽

Vol 34 (6) ◽

pp. 745 ◽

Cited By ~ 5

Author(s):

Bin WANG ◽

Yuanyuan WANG ◽

Wenhua XIAO ◽

Wei WANG ◽

Maojun ZHANG

Keyword(s):

Action Recognition ◽

Sparse Coding ◽

Human Action Recognition ◽

Human Action ◽

Video Representation

Download Full-text

Fusing Dynamic Images and Depth Motion Maps for Action Recognition in Surveillance Systems

International Journal of Sensors Wireless Communications and Control ◽

10.2174/2210327909666191209155141 ◽

2019 ◽

Vol 09 ◽

Author(s):

Rajat Khurana ◽

Alok Kumar Singh Kushwaha

Keyword(s):

Action Recognition ◽

Depth Map ◽

Good Choice ◽

Surveillance Systems ◽

Activity Detection ◽

Learning Capability ◽

Proposed Model ◽

Care Activity ◽

Increasing Demand ◽

Depth Motion Maps

Background & Objective: Identification of human actions from video has gathered much attention in past few years. Most of the computer vision tasks such as Health Care Activity Detection, Suspicious Activity detection, Human Computer Interactions etc. are based on the principle of activity detection. Automatic labelling of activity from videos frames is known as activity detection. Motivation of this work is to use most out of the data generated from sensors and use them for recognition of classes. Recognition of actions from videos sequences is a growing field with the upcoming trends of deep neural networks. Automatic learning capability of Convolutional Neural Network (CNN) make them good choice as compared to traditional handcrafted based approaches. With the increasing demand of RGB-D sensors combination of RGB and depth data is in great demand. This work comprises of the use of dynamic images generated from RGB combined with depth map for action recognition purpose. We have experimented our approach on pre trained VGG-F model using MSR Daily activity dataset and UTD MHAD Dataset. We achieve state of the art results. To support our research, we have calculated different parameters apart from accuracy such as precision, F score, recall. Conclusion: Accordingly, the investigation confirms improvement in term of accuracy, precision, F-Score and Recall. The proposed model is 4 Stream model is prone to occlusion, used in real time and also the data from the RGB-D sensor is fully utilized.

Download Full-text