Category-Level Multi-Attention based Boundary Refinement for Action Detection

Author(s):  
Peixiang Dong ◽  
Lisong Zhu ◽  
Yong Zhang
2020 ◽  
Vol 34 (07) ◽  
pp. 11612-11619
Author(s):  
Qinying Liu ◽  
Zilei Wang

Temporal action detection is a challenging task due to vagueness of action boundaries. To tackle this issue, we propose an end-to-end progressive boundary refinement network (PBRNet) in this paper. PBRNet belongs to the family of one-stage detectors and is equipped with three cascaded detection modules for localizing action boundary more and more precisely. Specifically, PBRNet mainly consists of coarse pyramidal detection, refined pyramidal detection, and fine-grained detection. The first two modules build two feature pyramids to perform the anchor-based detection, and the third one explores the frame-level features to refine the boundaries of each action instance. In the fined-grained detection module, three frame-level classification branches are proposed to augment the frame-level features and update the confidence scores of action instances. Evidently, PBRNet integrates the anchor-based and frame-level methods. We experimentally evaluate the proposed PBRNet and comprehensively investigate the effect of the main components. The results show PBRNet achieves the state-of-the-art detection performances on two popular benchmarks: THUMOS'14 and ActivityNet, and meanwhile possesses a high inference speed.


2021 ◽  
Vol 206 ◽  
pp. 103187
Author(s):  
Matteo Tomei ◽  
Lorenzo Baraldi ◽  
Simone Calderara ◽  
Simone Bronzin ◽  
Rita Cucchiara

2020 ◽  
Vol 14 (5) ◽  
pp. 177-184
Author(s):  
Ran Cui ◽  
Aichun Zhu ◽  
Jingran Wu ◽  
Gang Hua

2015 ◽  
Vol 17 (4) ◽  
pp. 512-525 ◽  
Author(s):  
Zhong Zhou ◽  
Feng Shi ◽  
Wei Wu

Digital Twin ◽  
2021 ◽  
Vol 1 ◽  
pp. 10
Author(s):  
Qing Hong ◽  
Yifeng Sun ◽  
Tingyu Liu ◽  
Liang Fu ◽  
Yunfeng Xie

Background: Intelligent monitoring of human action in production is an important step to help standardize production processes and construct a digital twin shop-floor rapidly. Human action has a significant impact on the production safety and efficiency of a shop-floor, however, because of the high individual initiative of humans, it is difficult to realize real-time action detection in a digital twin shop-floor. Methods: We proposed a real-time detection approach for shop-floor production action. This approach used the sequence data of continuous human skeleton joints sequences as the input. We then reconstructed the Joint Classification-Regression Recurrent Neural Networks (JCR-RNN) based on Temporal Convolution Network (TCN) and Graph Convolution Network (GCN). We called this approach the Temporal Action Detection Net (TAD-Net), which realized real-time shop-floor production action detection. Results: The results of the verification experiment showed that our approach has achieved a high temporal positioning score, recognition speed, and accuracy when applied to the existing Online Action Detection (OAD) dataset and the Nanjing University of Science and Technology 3 Dimensions (NJUST3D) dataset. TAD-Net can meet the actual needs of the digital twin shop-floor. Conclusions: Our method has higher recognition accuracy, temporal positioning accuracy, and faster running speed than other mainstream network models, it can better meet actual application requirements, and has important research value and practical significance for standardizing shop-floor production processes, reducing production security risks, and contributing to the understanding of real-time production action.


Sign in / Sign up

Export Citation Format

Share Document