attribute recognition
Recently Published Documents


TOTAL DOCUMENTS

207
(FIVE YEARS 122)

H-INDEX

14
(FIVE YEARS 4)

2022 ◽  
Vol 12 (2) ◽  
pp. 622
Author(s):  
Saadman Sakib ◽  
Kaushik Deb ◽  
Pranab Kumar Dhar ◽  
Oh-Jin Kwon

The pedestrian attribute recognition task is becoming more popular daily because of its significant role in surveillance scenarios. As the technological advances are significantly more than before, deep learning came to the surface of computer vision. Previous works applied deep learning in different ways to recognize pedestrian attributes. The results are satisfactory, but still, there is some scope for improvement. The transfer learning technique is becoming more popular for its extraordinary performance in reducing computation cost and scarcity of data in any task. This paper proposes a framework that can work in surveillance scenarios to recognize pedestrian attributes. The mask R-CNN object detector extracts the pedestrians. Additionally, we applied transfer learning techniques on different CNN architectures, i.e., Inception ResNet v2, Xception, ResNet 101 v2, ResNet 152 v2. The main contribution of this paper is fine-tuning the ResNet 152 v2 architecture, which is performed by freezing layers, last 4, 8, 12, 14, 20, none, and all. Moreover, data balancing techniques are applied, i.e., oversampling, to resolve the class imbalance problem of the dataset and analysis of the usefulness of this technique is discussed in this paper. Our proposed framework outperforms state-of-the-art methods, and it provides 93.41% mA and 89.24% mA on the RAP v2 and PARSE100K datasets, respectively.


2021 ◽  
Vol 7 (12) ◽  
pp. 264
Author(s):  
Sorn Sooksatra ◽  
Sitapa Rujikietgumjorn

This paper presents an extended model for a pedestrian attribute recognition network utilizing skeleton data as a soft attention model to extract a local feature corresponding to a specific attribute. This technique helped keep valuable information surrounding the target area and handle the variation of human posture. The attention masks were designed to focus on the partial and the whole-body regions. This research utilized an augmented layer for data augmentation inside the network to reduce over-fitting errors. Our network was evaluated in two datasets (RAP and PETA) with various backbone networks (ResNet-50, Inception V3, and Inception-ResNet V2). The experimental result shows that our network improves overall classification performance with a mean accuracy of about 2–3% in the same backbone network, especially local attributes and various human postures.


2021 ◽  
Author(s):  
Ye Li ◽  
Zhaoqian Jia ◽  
Yiyin Ding ◽  
Fangyan Shi ◽  
Guangqiang Yin

2021 ◽  
Author(s):  
Imran N. Junejo

We address the problem of Pedestrian Attribute Recognition (PAR) in this paper. Owing to the presence of surveillance cameras in almost all outdoor and indoor public spaces, keeping and eye on pedestrian is a sought-after task with many useful applications. The problem entails recognizing attributes such as age-group, clothing style, accessories, footwear style etc. This is a multi-label problem and challenging even for human observers. We propose using a convolution neural network (CNN) with trainable Gabor wavelets (TGW) layers. The proposed layers are learnable and adapt to the dataset for a better recognition. The proposed multi-branch neural network is a mix of TGW and convolutional layers and we show its effectiveness on a public dataset.


Author(s):  
Wendong Zhang ◽  
Sha Guan ◽  
Chunzhi Wang ◽  
Yucheng Zhang ◽  
Xianjing Zhou

2021 ◽  
Author(s):  
Xinze Dou ◽  
Yang Liu ◽  
Kai Lv ◽  
Zhang Xiong ◽  
Hao Sheng

Author(s):  
Elham Vahdati ◽  
Ching Y. Suen

Automatic analysis of facial beauty has become an emerging computer vision problem in recent years. Facial beauty prediction (FBP) aims at developing a human-like model that automatically makes facial attractiveness predictions. In this study, we present and evaluate a face attractiveness prediction approach using facial parts as well as a multi-task learning scheme. First, a deep convolutional neural network (CNN) pre-trained on massive face datasets is utilized for face attractiveness prediction, which is capable of automatic learning of high-level face representations. Next, we extend our deep model to other facial attribute recognition tasks. Hence, a multi-task learning scheme is leveraged by our deep model to learn optimal shared features for three correlated tasks (i.e. facial beauty assessment, gender recognition as well as ethnicity identification). To further enhance the attractiveness computation accuracy, specific regions of face images (i.e. left eye, nose and mouth) as well as the whole face are fed into multi-stream CNNs (i.e. three two-stream networks). Each two-stream network adopts a facial part as well as the full face as input. Extensive experiments are conducted on the SCUT-FBP5500 benchmark dataset, where our approach indicates significant improvement in accuracy over the other state-of-the-art methods.


Sign in / Sign up

Export Citation Format

Share Document