action localization Latest Research Papers

COWO: towards real-time spatiotemporal action localization in videos

Assembly Automation ◽

10.1108/aa-07-2021-0098 ◽

2022 ◽

Vol ahead-of-print (ahead-of-print) ◽

Author(s):

Yang Yi ◽

Yang Sun ◽

Saimei Yuan ◽

Yiji Zhu ◽

Mengyi Zhang ◽

...

Keyword(s):

Real Time ◽

Loss Function ◽

Real World ◽

Pearson Correlation ◽

Safety Monitoring ◽

Data Sets ◽

Human Actions ◽

Content Type ◽

Collaborative Assembly ◽

Action Localization

Purpose The purpose of this paper is to provide a fast and accurate network for spatiotemporal action localization in videos. It detects human actions both in time and space simultaneously in real-time, which is applicable in real-world scenarios such as safety monitoring and collaborative assembly. Design/methodology/approach This paper design an end-to-end deep learning network called collaborator only watch once (COWO). COWO recognizes the ongoing human activities in real-time with enhanced accuracy. COWO inherits from the architecture of you only watch once (YOWO), known to be the best performing network for online action localization to date, but with three major structural modifications: COWO enhances the intraclass compactness and enlarges the interclass separability in the feature level. A new correlation channel fusion and attention mechanism are designed based on the Pearson correlation coefficient. Accordingly, a correction loss function is designed. This function minimizes the same class distance and enhances the intraclass compactness. Use a probabilistic K-means clustering technique for selecting the initial seed points. The idea behind this is that the initial distance between cluster centers should be as considerable as possible. CIOU regression loss function is applied instead of the Smooth L1 loss function to help the model converge stably. Findings COWO outperforms the original YOWO with improvements of frame mAP 3% and 2.1% at a speed of 35.12 fps. Compared with the two-stream, T-CNN, C3D, the improvement is about 5% and 14.5% when applied to J-HMDB-21, UCF101-24 and AGOT data sets. Originality/value COWO extends more flexibility for assembly scenarios as it perceives spatiotemporal human actions in real-time. It contributes to many real-world scenarios such as safety monitoring and collaborative assembly.

Enhancing Class-semantics Features' Locating Performance for Temporal Action Localization

10.1109/ic-nidc54101.2021.9660459 ◽

2021 ◽

Author(s):

Jianmin Zhang ◽

Jianqin Yin

Keyword(s):

Action Localization ◽

Temporal Action

Improved Spatio-temporal Action Localization for Surveillance Videos

10.1109/dicta52665.2021.9647106 ◽

2021 ◽

Author(s):

Morgan Liang ◽

Xun Li ◽

Sandersan Onie ◽

Mark Larsen ◽

Arcot Sowmya

Keyword(s):

Surveillance Videos ◽

Action Localization ◽

Spatio Temporal ◽

Temporal Action

Multi‐scale feature learning and temporal probing strategy for one‐stage temporal action localization

International Journal of Intelligent Systems ◽

10.1002/int.22713 ◽

2021 ◽

Author(s):

Leiyue Yao ◽

Wei Yang ◽

Wei Huang ◽

Nan Jiang ◽

Bingbing Zhou

Keyword(s):

Feature Learning ◽

Scale Feature ◽

Multi Scale ◽

One Stage ◽

Action Localization ◽

Temporal Action

Weakly-Supervised Temporal Action Localization via Cross-Stream Collaborative Learning

10.1145/3474085.3475261 ◽

2021 ◽

Author(s):

Yuan Ji ◽

Xu Jia ◽

Huchuan Lu ◽

Xiang Ruan

Keyword(s):

Collaborative Learning ◽

Action Localization ◽

Weakly Supervised ◽

Temporal Action

Action Relational Graph for Weakly-Supervised Temporal Action Localization

10.1109/icip42928.2021.9506622 ◽

2021 ◽

Author(s):

Yi Cheng ◽

Ying Sun ◽

Dongyun Lin ◽

Joo-Hwee Lim

Keyword(s):

Action Localization ◽

Weakly Supervised ◽

Temporal Action

Few-Shot Action Localization without Knowing Boundaries

10.1145/3460426.3463643 ◽

2021 ◽

Author(s):

Ting-Ting Xie ◽

Christos Tzelepis ◽

Fan Fu ◽

Ioannis Patras

Keyword(s):

Action Localization

Self-Supervised Video Action Localization with Adversarial Temporal Transforms

Proceedings of the Thirtieth International Joint Conference on Artificial Intelligence ◽

10.24963/ijcai.2021/96 ◽

2021 ◽

Author(s):

Guoqiang Gong ◽

Liangfeng Zheng ◽

Wenhao Jiang ◽

Yadong Mu

Keyword(s):

State Of The Art ◽

Experimental Results ◽

Policy Network ◽

Video Classification ◽

Time Warping ◽

Action Localization ◽

Temporal Boundary ◽

Consistency Constraint ◽

Weakly Supervised ◽

Temporal Action

Weakly-supervised temporal action localization aims to locate intervals of action instances with only video-level action labels for training. However, the localization results generated from video classification networks are often not accurate due to the lack of temporal boundary annotation of actions. Our motivating insight is that the temporal boundary of action should be stably predicted under various temporal transforms. This inspires a self-supervised equivariant transform consistency constraint. We design a set of temporal transform operations, including naive temporal down-sampling to learnable attention-piloted time warping. In our model, a localization network aims to perform well under all transforms, and another policy network is designed to choose a temporal transform at each iteration that adversarially brings localization result inconsistent with the localization network's. Additionally, we devise a self-refine module to enhance the completeness of action intervals harnessing temporal and semantic contexts. Experimental results on THUMOS14 and ActivityNet demonstrate that our model consistently outperforms the state-of-the-art weakly-supervised temporal action localization methods.

SAPS: Self-attentive pathway search for weakly-supervised action localization with background-action augmentation

Computer Vision and Image Understanding ◽

10.1016/j.cviu.2021.103256 ◽

2021 ◽

pp. 103256

Author(s):

Xiao-Yu Zhang ◽

Yaru Zhang ◽

Haichao Shi ◽

Jing Dong

Keyword(s):

Action Localization ◽

Weakly Supervised ◽

Pathway Search

Deep feature enhancing and selecting network for weakly supervised temporal action localization

Journal of Visual Communication and Image Representation ◽

10.1016/j.jvcir.2021.103276 ◽

2021 ◽

Author(s):

Jiaruo Yu ◽

Yongxin Ge ◽

Xiaolei Qin ◽

Ziqiang Li ◽

Sheng Huang ◽

...

Keyword(s):

Deep Feature ◽

Action Localization ◽

Weakly Supervised ◽

Temporal Action

action localization
Recently Published Documents

TOTAL DOCUMENTS

H-INDEX

COWO: towards real-time spatiotemporal action localization in videos

Enhancing Class-semantics Features' Locating Performance for Temporal Action Localization

Improved Spatio-temporal Action Localization for Surveillance Videos

Multi‐scale feature learning and temporal probing strategy for one‐stage temporal action localization

Weakly-Supervised Temporal Action Localization via Cross-Stream Collaborative Learning

Action Relational Graph for Weakly-Supervised Temporal Action Localization

Few-Shot Action Localization without Knowing Boundaries

Self-Supervised Video Action Localization with Adversarial Temporal Transforms

SAPS: Self-attentive pathway search for weakly-supervised action localization with background-action augmentation

Deep feature enhancing and selecting network for weakly supervised temporal action localization

Export Citation Format

action localizationRecently Published Documents

TOTAL DOCUMENTS

H-INDEX

COWO: towards real-time spatiotemporal action localization in videos

Enhancing Class-semantics Features' Locating Performance for Temporal Action Localization

Improved Spatio-temporal Action Localization for Surveillance Videos

Multi‐scale feature learning and temporal probing strategy for one‐stage temporal action localization

Weakly-Supervised Temporal Action Localization via Cross-Stream Collaborative Learning

Action Relational Graph for Weakly-Supervised Temporal Action Localization

Few-Shot Action Localization without Knowing Boundaries

Self-Supervised Video Action Localization with Adversarial Temporal Transforms

SAPS: Self-attentive pathway search for weakly-supervised action localization with background-action augmentation

Deep feature enhancing and selecting network for weakly supervised temporal action localization

action localization
Recently Published Documents