Meta Parsing Networks: Towards Generalized Few-shot Scene Parsing with Adaptive Metric Learning

Author(s):  
Peike Li ◽  
Yunchao Wei ◽  
Yi Yang
2020 ◽  
pp. 1-1
Author(s):  
Feiyu Chen ◽  
Jie Shao ◽  
Yonghui Zhang ◽  
Xing Xu ◽  
Heng Tao Shen

Author(s):  
Shuo Chen ◽  
Chen Gong ◽  
Jian Yang ◽  
Ying Tai ◽  
Le Hui ◽  
...  

The central problem for most existing metric learning methods is to find a suitable projection matrix on the differences of all pairs of data points. However, a single unified projection matrix can hardly characterize all data similarities accurately as the practical data are usually very complicated, and simply adopting one global projection matrix might ignore important local patterns hidden in the dataset. To address this issue, this paper proposes a novel method dubbed “Data-Adaptive Metric Learning” (DAML), which constructs a data-adaptive projection matrix for each data pair by selectively combining a set of learned candidate matrices. As a result, every data pair can obtain a specific projection matrix, enabling the proposed DAML to flexibly fit the training data and produce discriminative projection results. The model of DAML is formulated as an optimization problem which jointly learns candidate projection matrices and their sparse combination for every data pair. Nevertheless, the over-fitting problem may occur due to the large amount of parameters to be learned. To tackle this issue, we adopt the Total Variation (TV) regularizer to align the scales of data embedding produced by all candidate projection matrices, and thus the generated metrics of these learned candidates are generally comparable. Furthermore, we extend the basic linear DAML model to the kernerlized version (denoted “KDAML”) to handle the non-linear cases, and the Iterative Shrinkage-Thresholding Algorithm (ISTA) is employed to solve the optimization model. Intensive experimental results on various applications including retrieval, classification, and verification clearly demonstrate the superiority of our algorithm to other state-of-the-art metric learning methodologies.


2014 ◽  
Vol E97.D (11) ◽  
pp. 2888-2902 ◽  
Author(s):  
Guanwen ZHANG ◽  
Jien KATO ◽  
Yu WANG ◽  
Kenji MASE

2015 ◽  
Vol 24 (11) ◽  
pp. 3321-3331 ◽  
Author(s):  
Shuang Li ◽  
Huchuan Lu ◽  
Zhe Lin ◽  
Xiaohui Shen ◽  
Brian Price

Author(s):  
Rui Qian ◽  
Yunchao Wei ◽  
Honghui Shi ◽  
Jiachen Li ◽  
Jiaying Liu ◽  
...  

Semantic scene parsing is suffering from the fact that pixellevel annotations are hard to be collected. To tackle this issue, we propose a Point-based Distance Metric Learning (PDML) in this paper. PDML does not require dense annotated masks and only leverages several labeled points that are much easier to obtain to guide the training process. Concretely, we leverage semantic relationship among the annotated points by encouraging the feature representations of the intra- and intercategory points to keep consistent, i.e. points within the same category should have more similar feature representations compared to those from different categories. We formulate such a characteristic into a simple distance metric loss, which collaborates with the point-wise cross-entropy loss to optimize the deep neural networks. Furthermore, to fully exploit the limited annotations, distance metric learning is conducted across different training images instead of simply adopting an image-dependent manner. We conduct extensive experiments on two challenging scene parsing benchmarks of PASCALContext and ADE 20K to validate the effectiveness of our PDML, and competitive mIoU scores are achieved.


Sign in / Sign up

Export Citation Format

Share Document