video structuring
Recently Published Documents

TOTAL DOCUMENTS

23

(FIVE YEARS 1)

H-INDEX

5

(FIVE YEARS 1)

Latest Documents Most Cited Documents Contributed Authors Related Sources Related Keywords

Pedestrian Attributes Recognition in Surveillance Scenarios Using Multi-Task Lightweight Convolutional Neural Network

Applied Sciences ◽

10.3390/app9194182 ◽

2019 ◽

Vol 9 (19) ◽

pp. 4182 ◽

Author(s):

Pu Yan ◽

Li Zhuo ◽

Jiafeng Li ◽

Hui Zhang ◽

Jing Zhang

Keyword(s):

Neural Network ◽

Convolutional Neural Network ◽

State Of The Art ◽

Cross Entropy ◽

Semantic Features ◽

Recognition Method ◽

Relationship Model ◽

Fully Connected ◽

Video Structuring

Pedestrian attributes (such as gender, age, hairstyle, and clothing) can effectively represent the appearance of pedestrians. These are high-level semantic features that are robust to illumination, deformation, etc. Therefore, they can be widely used in person re-identification, video structuring analysis and other applications. In this paper, a pedestrian attributes recognition method for surveillance scenarios using a multi-task lightweight convolutional neural network is proposed. Firstly, the labels of the attributes for each pedestrian image are integrated into a label vector. Then, a multi-task lightweight Convolutional Neural Network (CNN) is designed, which consists of five convolutional layers, three pooling layers and two fully connected layers to extract the deep features of pedestrian images. Considering that the data distribution of the datasets is unbalanced, the loss function is improved based on the sigmoid cross-entropy, and the scale factor is added to balance the amount of various attributes data. Through training the network, the mapping relationship model between the deep features of pedestrian images and the integration label vector of their attributes is established, which can be used to predict each attribute of the pedestrian. The experiments were conducted on two public pedestrian attributes datasets in surveillance scenarios, namely PETA and RAP. The results show that, compared with the state-of-the-art pedestrian attributes recognition methods, the proposed method can achieve a superior accuracy by 91.88% on PETA and 87.44% on RAP respectively.

Download Full-text

Improving Cluster Selection and Event Modeling in Unsupervised Mining for Automatic Audiovisual Video Structuring

Lecture Notes in Computer Science - Advances in Multimedia Modeling ◽

10.1007/978-3-642-27355-1_49 ◽

2012 ◽

pp. 529-540 ◽

Author(s):

Anh-Phuong Ta ◽

Mathieu Ben ◽

Guillaume Gravier

Keyword(s):

Event Modeling ◽

Video Structuring

Download Full-text

Automatic Multilevel Temporal Video Structuring

2011 IEEE Fifth International Conference on Semantic Computing ◽

10.1109/icsc.2011.39 ◽

2011 ◽

Author(s):

Ruxandra Tapu ◽

Titus Zaharia

Keyword(s):

Video Structuring

Download Full-text

Video Structuring

10.1007/springerreference_66053 ◽

2011 ◽

Keyword(s):

Video Structuring

Download Full-text

Temporal video structuring for preservation and annotation of video content

2009 16th IEEE International Conference on Image Processing (ICIP) ◽

10.1109/icip.2009.5414114 ◽

2009 ◽

Author(s):

Christian Petersohn

Keyword(s):

Video Content ◽

Video Structuring

Download Full-text

Video Structuring

Encyclopedia of Database Systems ◽

10.1007/978-0-387-39940-9_3948 ◽

2009 ◽

pp. 3320-3320

Keyword(s):

Video Structuring

Download Full-text

Home video structuring with a two-layer shot clustering approach

2008 3rd International Symposium on Communications, Control and Signal Processing ◽

10.1109/isccsp.2008.4537277 ◽

2008 ◽

Author(s):

Yu-Jin Zhang ◽

Fan Jiang

Keyword(s):

Clustering Approach ◽

Video Structuring

Download Full-text

Detecting repeats for video structuring

Multimedia Tools and Applications ◽

10.1007/s11042-007-0180-1 ◽

2007 ◽

Vol 38 (2) ◽

pp. 233-252 ◽

Author(s):

Xavier Naturel ◽

Patrick Gros

Keyword(s):

Video Structuring

Download Full-text

Using Graphics Processor Units (GPUs) for Automatic Video Structuring

Eighth International Workshop on Image Analysis for Multimedia Interactive Services (WIAMIS '07) ◽

10.1109/wiamis.2007.85 ◽

2007 ◽

Author(s):

Peter Kehoe ◽

Alan F. Smeaton

Keyword(s):

Graphics Processor ◽

Video Structuring

Download Full-text

Score oriented Viterbi search in sport video structuring using HMM and segment models

2006 IEEE Workshop on Multimedia Signal Processing ◽

10.1109/mmsp.2006.285356 ◽

2006 ◽

Author(s):

M. Delakis ◽

G. Gravier ◽

P. Gros

Keyword(s):

Sport Video ◽

Viterbi Search ◽

Video Structuring

Download Full-text