Joint Learning of Dictionary and Convolutional Network for Pedestrian Attribute Recognition

Author(s):  
Yan Sha ◽  
Congyan Lang ◽  
Peixi Peng ◽  
Junliang Xing ◽  
Danxia Li
2020 ◽  
Vol 34 (07) ◽  
pp. 12055-12062
Author(s):  
Zichang Tan ◽  
Yang Yang ◽  
Jun Wan ◽  
Guodong Guo ◽  
Stan Z. Li

In this paper, we propose a new end-to-end network, named Joint Learning of Attribute and Contextual relations (JLAC), to solve the task of pedestrian attribute recognition. It includes two novel modules: Attribute Relation Module (ARM) and Contextual Relation Module (CRM). For ARM, we construct an attribute graph with attribute-specific features which are learned by the constrained losses, and further use Graph Convolutional Network (GCN) to explore the correlations among multiple attributes. For CRM, we first propose a graph projection scheme to project the 2-D feature map into a set of nodes from different image regions, and then employ GCN to explore the contextual relations among those regions. Since the relation information in the above two modules is correlated and complementary, we incorporate them into a unified framework to learn both together. Experiments on three benchmarks, including PA-100K, RAP, PETA attribute datasets, demonstrate the effectiveness of the proposed JLAC.


2020 ◽  
pp. 1-1
Author(s):  
Haonan Fan ◽  
Hai-Miao Hu ◽  
Shuailing Liu ◽  
Weiqing Lu ◽  
Shiliang Pu

Author(s):  
Qiaozhe Li ◽  
Xin Zhao ◽  
Ran He ◽  
Kaiqi Huang

Pedestrian attribute recognition in surveillance is a challenging task due to poor image quality, significant appearance variations and diverse spatial distribution of different attributes. This paper treats pedestrian attribute recognition as a sequential attribute prediction problem and proposes a novel visual-semantic graph reasoning framework to address this problem. Our framework contains a spatial graph and a directed semantic graph. By performing reasoning using the Graph Convolutional Network (GCN), one graph captures spatial relations between regions and the other learns potential semantic relations between attributes. An end-to-end architecture is presented to perform mutual embedding between these two graphs to guide the relational learning for each other. We verify the proposed framework on three large scale pedestrian attribute datasets including PETA, RAP, and PA100k. Experiments show superiority of the proposed method over state-of-the-art methods and effectiveness of our joint GCN structures for sequential attribute prediction.


Geophysics ◽  
2021 ◽  
pp. 1-50
Author(s):  
Ahmad Mustafa ◽  
Motaz Alfarraj ◽  
Ghassan AlRegib

Seismic inversion plays a very useful role in detailed stratigraphic interpretation of migrated seismic volumes by enabling the estimation of reservoir properties over the complete volume. Traditional and machine learning-based seismic inversion workflows are limited to inverting each seismic trace independently of other traces to estimate impedance profiles, leading to lateral discontinuities in the presence of noise and large geological variations in the seismic data. In addition, machine learning-based approaches suffer the problem of overfitting if there is only a small number of wells on which the model is trained. We propose a two-pronged strategy to overcome these problems. We present a temporal convolutional network that models seismic traces temporally. We further inject spatial context for each trace into its estimations of the impedance profile. To counter the problem of limited labeled data, we also present a joint learning scheme whereby multiple datasets are simultaneously used for training, sharing beneficial information among each other. This results in the improvement in generalization performance on all datasets. We present a case study of acoustic impedance inversion using the open-source SEAM and Marmousi 2 datasets. Our evaluations show that our proposed approach is able to estimate impedance in the presence of noisy seismic data and a limited number of well logs with greater robustness and spatial consistency. We compare and contrast our approach to other learning-based seismic inversion methodologies in the literature. On SEAM, we are able to obtain an average MSE of 0.0476, the lowest among all other methodologies.


2019 ◽  
Vol 11 (11) ◽  
pp. 245 ◽  
Author(s):  
Xiangpeng Song ◽  
Hongbin Yang ◽  
Congcong Zhou

Pedestrian attribute recognition is to predict a set of attribute labels of the pedestrian from surveillance scenarios, which is a very challenging task for computer vision due to poor image quality, continual appearance variations, as well as diverse spatial distribution of imbalanced attributes. It is desirable to model the label dependencies between different attributes to improve the recognition performance as each pedestrian normally possesses many attributes. In this paper, we treat pedestrian attribute recognition as multi-label classification and propose a novel model based on the graph convolutional network (GCN). The model is mainly divided into two parts, we first use convolutional neural network (CNN) to extract pedestrian feature, which is a normal operation processing image in deep learning, then we transfer attribute labels to word embedding and construct a correlation matrix between labels to help GCN propagate information between nodes. This paper applies the object classifiers learned by GCN to the image representation extracted by CNN to enable the model to have the ability to be end-to-end trainable. Experiments on pedestrian attribute recognition dataset show that the approach obviously outperforms other existing state-of-the-art methods.


Author(s):  
Atsushi Ando ◽  
Ryo Masumura ◽  
Hosana Kamiyama ◽  
Satoshi Kobashikawa ◽  
Yushi Aono

Sign in / Sign up

Export Citation Format

Share Document