action detection Latest Research Papers

Action detection and classification in kitchen activities videos using graph decoding

The Visual Computer ◽

10.1007/s00371-021-02346-5 ◽

2022 ◽

Author(s):

Mona Ramadan ◽

Amro El-Jaroudi

Keyword(s):

Action Detection

Intelligent Video Analytics For Human Action Detection: A Deep Learning Approach With Transfer Learning

International Journal of Computing and Digital Systems ◽

10.12785/ijcds/110105 ◽

2022 ◽

Vol 11 (1) ◽

pp. 63-72

Author(s):

Saylee Begampure ◽

Parul Jadhav

Keyword(s):

Deep Learning ◽

Transfer Learning ◽

Human Action ◽

Learning Approach ◽

Video Analytics ◽

Action Detection ◽

Human Action Detection

Detection of Aerobics Action Based on Convolutional Neural Network

Computational Intelligence and Neuroscience ◽

10.1155/2022/1857406 ◽

2022 ◽

Vol 2022 ◽

pp. 1-10

Author(s):

Siyu Zhang

Keyword(s):

Neural Network ◽

High Resolution ◽

Loss Function ◽

Semantic Information ◽

Deep Level ◽

Image Features ◽

Action Detection ◽

The Neural Network ◽

Feature Pyramid ◽

Anchor Points

To further improve the accuracy of aerobics action detection, a method of aerobics action detection based on improving multiscale characteristics is proposed. In this method, based on faster R-CNN and aiming at the problems existing in faster R-CNN, the feature pyramid network (FPN) is used to extract aerobics action image features. So, the low-level semantic information in the images can be extracted, and it can be converted into high-resolution deep-level semantic information. Finally, the target detector is constructed by the above-extracted anchor points so as to realize the detection of aerobics action. The results show that the loss function of the neural network is reduced to 0.2 by using the proposed method, and the accuracy of the proposed method can reach 96.5% compared with other methods, which proves the feasibility of this study.

Visual Feature Learning on Video Object and Human Action Detection: A Systematic Review

Micromachines ◽

10.3390/mi13010072 ◽

2021 ◽

Vol 13 (1) ◽

pp. 72

Author(s):

Dengshan Li ◽

Rujing Wang ◽

Peng Chen ◽

Chengjun Xie ◽

Qiong Zhou ◽

...

Keyword(s):

Object Detection ◽

Action Recognition ◽

Human Action Recognition ◽

Human Action ◽

Detection Methods ◽

Video Object ◽

Action Detection ◽

Video Frames ◽

Video Detection ◽

Human Action Detection

Video object and human action detection are applied in many fields, such as video surveillance, face recognition, etc. Video object detection includes object classification and object location within the frame. Human action recognition is the detection of human actions. Usually, video detection is more challenging than image detection, since video frames are often more blurry than images. Moreover, video detection often has other difficulties, such as video defocus, motion blur, part occlusion, etc. Nowadays, the video detection technology is able to implement real-time detection, or high-accurate detection of blurry video frames. In this paper, various video object and human action detection approaches are reviewed and discussed, many of them have performed state-of-the-art results. We mainly review and discuss the classic video detection methods with supervised learning. In addition, the frequently-used video object detection and human action recognition datasets are reviewed. Finally, a summarization of the video detection is represented, e.g., the video object and human action detection methods could be classified into frame-by-frame (frame-based) detection, extracting-key-frame detection and using-temporal-information detection; the methods of utilizing temporal information of adjacent video frames are mainly the optical flow method, Long Short-Term Memory and convolution among adjacent frames.

Research on Human Motion Analysis in Moving Scene Based on Timing Detection and Video Description Algorithm

Discrete Dynamics in Nature and Society ◽

10.1155/2021/4320846 ◽

2021 ◽

Vol 2021 ◽

pp. 1-10

Author(s):

Quanping Shen ◽

Songzhong Ye

Keyword(s):

Domain Knowledge ◽

Movement Analysis ◽

Recall Rate ◽

Detection Algorithm ◽

Human Motion ◽

Small Scale ◽

Human Motion Analysis ◽

Action Detection ◽

Target Action ◽

Temporal Action

Technical movement analysis requires specialized domain knowledge and processing a large amount of data, and the advantages of AI in processing data can improve the efficiency of data analysis. In this paper, we propose a feature pyramid network-based temporal action detection (FPN-TAD) algorithm, which is used to solve the problem that the action proposal module has a low recall rate for small-scale temporal target action regions in the current video temporal action detection algorithm research. This paper is divided into three parts. The first part is an overview of the algorithm; the second part elaborates the network structure and the working principle of the FPN-TAD algorithm; and the third part gives the experimental results and analysis of the algorithm.

Mask attention-guided graph convolution layer for weakly supervised temporal action detection

Multimedia Tools and Applications ◽

10.1007/s11042-021-11768-1 ◽

2021 ◽

Author(s):

Mengyao Zhao ◽

Zhengping Hu ◽

Shufang Li ◽

Shuai Bi ◽

Zhe Sun

Keyword(s):

Action Detection ◽

Weakly Supervised ◽

Temporal Action

TAD-Net: An approach for real-time action detection based on temporal convolution network and graph convolution network in digital twin shop-floor

Digital Twin ◽

10.12688/digitaltwin.17408.1 ◽

2021 ◽

Vol 1 ◽

pp. 10

Author(s):

Qing Hong ◽

Yifeng Sun ◽

Tingyu Liu ◽

Liang Fu ◽

Yunfeng Xie

Keyword(s):

Real Time ◽

Sequence Data ◽

Network Models ◽

Human Action ◽

Practical Significance ◽

Shop Floor ◽

Action Detection ◽

Digital Twin ◽

Actual Application ◽

Production Processes

Background: Intelligent monitoring of human action in production is an important step to help standardize production processes and construct a digital twin shop-floor rapidly. Human action has a significant impact on the production safety and efficiency of a shop-floor, however, because of the high individual initiative of humans, it is difficult to realize real-time action detection in a digital twin shop-floor. Methods: We proposed a real-time detection approach for shop-floor production action. This approach used the sequence data of continuous human skeleton joints sequences as the input. We then reconstructed the Joint Classification-Regression Recurrent Neural Networks (JCR-RNN) based on Temporal Convolution Network (TCN) and Graph Convolution Network (GCN). We called this approach the Temporal Action Detection Net (TAD-Net), which realized real-time shop-floor production action detection. Results: The results of the verification experiment showed that our approach has achieved a high temporal positioning score, recognition speed, and accuracy when applied to the existing Online Action Detection (OAD) dataset and the Nanjing University of Science and Technology 3 Dimensions (NJUST3D) dataset. TAD-Net can meet the actual needs of the digital twin shop-floor. Conclusions: Our method has higher recognition accuracy, temporal positioning accuracy, and faster running speed than other mainstream network models, it can better meet actual application requirements, and has important research value and practical significance for standardizing shop-floor production processes, reducing production security risks, and contributing to the understanding of real-time production action.

Viewpoint Robustness of Automated Facial Action Unit Detection Systems

Applied Sciences ◽

10.3390/app112311171 ◽

2021 ◽

Vol 11 (23) ◽

pp. 11171

Author(s):

Shushi Namba ◽

Wataru Sato ◽

Sakiko Yoshikawa

Keyword(s):

Prediction Accuracy ◽

Detection System ◽

Action Unit ◽

Action Detection ◽

Facial Action ◽

Detection Systems ◽

The Face ◽

Overall Performance ◽

Action Unit Detection ◽

Facial Images

Automatic facial action detection is important, but no previous studies have evaluated pre-trained models on the accuracy of facial action detection as the angle of the face changes from frontal to profile. Using static facial images obtained at various angles (0°, 15°, 30°, and 45°), we investigated the performance of three automated facial action detection systems (FaceReader, OpenFace, and Py-feat). The overall performance was best for OpenFace, followed by FaceReader and Py-Feat. The performance of FaceReader significantly decreased at 45° compared to that at other angles, while the performance of Py-Feat did not differ among the four angles. The performance of OpenFace decreased as the target face turned sideways. Prediction accuracy and robustness to angle changes varied with the target facial components and action detection system.

An empirical study on temporal modeling for online action detection

Complex & Intelligent Systems ◽

10.1007/s40747-021-00534-3 ◽

2021 ◽

Author(s):

Wen Wang ◽

Xiaojiang Peng ◽

Yu Qiao ◽

Jian Cheng

Keyword(s):

Neural Networks ◽

Empirical Study ◽

Recurrent Neural Networks ◽

State Of The Art ◽

Deep Convolutional Neural Networks ◽

Temporal Modeling ◽

Action Detection ◽

Modeling Methods ◽

Feature Extractor ◽

First Time

AbstractOnline action detection (OAD) is a practical yet challenging task, which has attracted increasing attention in recent years. A typical OAD system mainly consists of three modules: a frame-level feature extractor which is usually based on pre-trained deep Convolutional Neural Networks (CNNs), a temporal modeling module, and an action classifier. Among them, the temporal modeling module is crucial which aggregates discriminative information from historical and current features. Though many temporal modeling methods have been developed for OAD and other topics, their effects are lack of investigation on OAD fairly. This paper aims to provide an empirical study on temporal modeling for OAD including four meta types of temporal modeling methods, i.e. temporal pooling, temporal convolution, recurrent neural networks, and temporal attention, and uncover some good practices to produce a state-of-the-art OAD system. Many of them are explored in OAD for the first time, and extensively evaluated with various hyper parameters. Furthermore, based on our empirical study, we present several hybrid temporal modeling methods. Our best networks, i.e. , the hybridization of DCC, LSTM and M-NL, and the hybridization of DCC and M-NL, which outperform previously published results with sizable margins on THUMOS-14 dataset (48.6% vs. 47.2%) and TVSeries dataset (84.3% vs. 83.7%).

A Study on Interaction Prediction for Reducing Interaction Latency in Remote Mixed Reality Collaboration

Applied Sciences ◽

10.3390/app112210693 ◽

2021 ◽

Vol 11 (22) ◽

pp. 10693

Author(s):

Yujin Choi ◽

Wookho Son ◽

Yoon Sang Kim

Keyword(s):

Mixed Reality ◽

Experimental Results ◽

Virtual Object ◽

Joint Angles ◽

Action Detection ◽

Virtual Objects ◽

Interaction Prediction ◽

Conventional Methods ◽

The One

Various studies on latency in remote mixed reality collaborations (remote MR collaboration) have been conducted, but studies related to interaction latency are scarce. Interaction latency in a remote MR collaboration occurs because action detection (such as contact or collision) between a human and a virtual object is required for finding the interaction performed. Therefore, in this paper, we propose a method based on interaction prediction to reduce the time for detecting the action between humans and virtual objects. The proposed method predicts an interaction based on consecutive joint angles. To examine the effectiveness of the proposed method, an experiment was conducted and the results were given. From the experimental results, it was confirmed that the proposed method could reduce the interaction latency compared to the one obtained by conventional methods.

action detection
Recently Published Documents

TOTAL DOCUMENTS

H-INDEX

Action detection and classification in kitchen activities videos using graph decoding

Intelligent Video Analytics For Human Action Detection: A Deep Learning Approach With Transfer Learning

Detection of Aerobics Action Based on Convolutional Neural Network

Visual Feature Learning on Video Object and Human Action Detection: A Systematic Review

Research on Human Motion Analysis in Moving Scene Based on Timing Detection and Video Description Algorithm

Mask attention-guided graph convolution layer for weakly supervised temporal action detection

TAD-Net: An approach for real-time action detection based on temporal convolution network and graph convolution network in digital twin shop-floor

Viewpoint Robustness of Automated Facial Action Unit Detection Systems

An empirical study on temporal modeling for online action detection

A Study on Interaction Prediction for Reducing Interaction Latency in Remote Mixed Reality Collaboration

Export Citation Format

action detectionRecently Published Documents

TOTAL DOCUMENTS

H-INDEX

Action detection and classification in kitchen activities videos using graph decoding

Intelligent Video Analytics For Human Action Detection: A Deep Learning Approach With Transfer Learning

Detection of Aerobics Action Based on Convolutional Neural Network

Visual Feature Learning on Video Object and Human Action Detection: A Systematic Review

Research on Human Motion Analysis in Moving Scene Based on Timing Detection and Video Description Algorithm

Mask attention-guided graph convolution layer for weakly supervised temporal action detection

TAD-Net: An approach for real-time action detection based on temporal convolution network and graph convolution network in digital twin shop-floor

Viewpoint Robustness of Automated Facial Action Unit Detection Systems

An empirical study on temporal modeling for online action detection

A Study on Interaction Prediction for Reducing Interaction Latency in Remote Mixed Reality Collaboration

action detection
Recently Published Documents