Large Scale Landmark Recognition via Deep Metric Learning

(1) Background: The segmentation of cell nuclei is an essential task in a wide range of biomedical studies and clinical practices. The full automation of this process remains a challenge due to intra- and internuclear variations across a wide range of tissue morphologies, differences in staining protocols and imaging procedures. (2) Methods: A deep learning model with metric embeddings such as contrastive loss and triplet loss with semi-hard negative mining is proposed in order to accurately segment cell nuclei in a diverse set of microscopy images. The effectiveness of the proposed model was tested on a large-scale multi-tissue collection of microscopy image sets. (3) Results: The use of deep metric learning increased the overall segmentation prediction by 3.12% in the average value of Dice similarity coefficients as compared to no metric learning. In particular, the largest gain was observed for segmenting cell nuclei in H&E -stained images when deep learning network and triplet loss with semi-hard negative mining were considered for the task. (4) Conclusion: We conclude that deep metric learning gives an additional boost to the overall learning process and consequently improves the segmentation performance. Notably, the improvement ranges approximately between 0.13% and 22.31% for different types of images in the terms of Dice coefficients when compared to no metric deep learning.

Download Full-text

Large-scale singer recognition using deep metric learning: an experimental study

10.1109/ijcnn52387.2021.9533911 ◽

2021 ◽

Author(s):

Shichao Hu ◽

Beici Liang ◽

Zhouxuan Chen ◽

Xiao Lu ◽

Ethan Zhao ◽

...

Keyword(s):

Experimental Study ◽

Large Scale ◽

Metric Learning ◽

Deep Metric Learning

Download Full-text

Optimal Feature Transport for Cross-View Image Geo-Localization

Proceedings of the AAAI Conference on Artificial Intelligence ◽

10.1609/aaai.v34i07.6875 ◽

2020 ◽

Vol 34 (07) ◽

pp. 11990-11997 ◽

Cited By ~ 1

Author(s):

Yujiao Shi ◽

Xin Yu ◽

Liu Liu ◽

Tong Zhang ◽

Hongdong Li

Keyword(s):

Large Scale ◽

State Of The Art ◽

Metric Learning ◽

Satellite Image ◽

Geographic Location ◽

Ground Level ◽

Aerial Images ◽

Query Image ◽

Aerial View ◽

Deep Metric Learning

This paper addresses the problem of cross-view image geo-localization, where the geographic location of a ground-level street-view query image is estimated by matching it against a large scale aerial map (e.g., a high-resolution satellite image). State-of-the-art deep-learning based methods tackle this problem as deep metric learning which aims to learn global feature representations of the scene seen by the two different views. Despite promising results are obtained by such deep metric learning methods, they, however, fail to exploit a crucial cue relevant for localization, namely, the spatial layout of local features. Moreover, little attention is paid to the obvious domain gap (between aerial view and ground view) in the context of cross-view localization. This paper proposes a novel Cross-View Feature Transport (CVFT) technique to explicitly establish cross-view domain transfer that facilitates feature alignment between ground and aerial images. Specifically, we implement the CVFT as network layers, which transports features from one domain to the other, leading to more meaningful feature similarity comparison. Our model is differentiable and can be learned end-to-end. Experiments on large-scale datasets have demonstrated that our method has remarkably boosted the state-of-the-art cross-view localization performance, e.g., on the CVUSA dataset, with significant improvements for top-1 recall from 40.79% to 61.43%, and for top-10 from 76.36% to 90.49%. We expect the key insight of the paper (i.e., explicitly handling domain difference via domain transport) will prove to be useful for other similar problems in computer vision as well.

Download Full-text

High-Rankness Regularized Semi-Supervised Deep Metric Learning for Remote Sensing Imagery

Remote Sensing ◽

10.3390/rs12162603 ◽

2020 ◽

Vol 12 (16) ◽

pp. 2603

Author(s):

Jian Kang ◽

Rubén Fernández-Beltrán ◽

Zhen Ye ◽

Xiaohua Tong ◽

Pedram Ghamisi ◽

...

Keyword(s):

Remote Sensing ◽

Large Scale ◽

Semantic Information ◽

Metric Spaces ◽

Metric Learning ◽

Training Data ◽

Deep Metric Learning ◽

Class Labels ◽

Image Pairs ◽

Decision Boundaries

Deep metric learning has recently received special attention in the field of remote sensing (RS) scene characterization, owing to its prominent capabilities for modeling distances among RS images based on their semantic information. Most of the existing deep metric learning methods exploit pairwise and triplet losses to learn the feature embeddings with the preservation of semantic-similarity, which requires the construction of image pairs and triplets based on the supervised information (e.g., class labels). However, generating such semantic annotations becomes a completely unaffordable task in large-scale RS archives, which may eventually constrain the availability of sufficient training data for this kind of models. To address this issue, we reformulate the deep metric learning scheme in a semi-supervised manner to effectively characterize RS scenes. Specifically, we aim at learning metric spaces by utilizing the supervised information from a small number of labeled RS images and exploring the potential decision boundaries for massive sets of unlabeled aerial scenes. In order to reach this goal, a joint loss function, composed of a normalized softmax loss with margin and a high-rankness regularization term, is proposed, as well as its corresponding optimization algorithm. The conducted experiments (including different state-of-the-art methods and two benchmark RS archives) validate the effectiveness of the proposed approach for RS image classification, clustering and retrieval tasks. The codes of this paper are publicly available.

Download Full-text

Dysarthric Speech Recognition Based on Deep Metric Learning

10.21437/interspeech.2020-2267 ◽

2020 ◽

Author(s):

Yuki Takashima ◽

Ryoichi Takashima ◽

Tetsuya Takiguchi ◽

Yasuo Ariki

Keyword(s):

Speech Recognition ◽

Metric Learning ◽

Deep Metric Learning ◽

Dysarthric Speech

Download Full-text

Deep Metric Learning-based Image Retrieval System for Chest Radiograph and its Clinical Applications in COVID-19

Medical Image Analysis ◽

10.1016/j.media.2021.101993 ◽

2021 ◽

pp. 101993

Author(s):

Aoxiao Zhong ◽

Xiang Li ◽

Dufan Wu ◽

Hui Ren ◽

Kyungsang Kim ◽

...

Keyword(s):

Image Retrieval ◽

Chest Radiograph ◽

Retrieval System ◽

Metric Learning ◽

Clinical Applications ◽

Image Retrieval System ◽

Deep Metric Learning

Download Full-text

A Ranked Similarity Loss Function with pair Weighting for Deep Metric Learning

ICASSP 2021 - 2021 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) ◽

10.1109/icassp39728.2021.9414668 ◽

2021 ◽

Author(s):

Jian Wang ◽

Zhichao Zhang ◽

Dongmei Huang ◽

Wei Song ◽

Quanmiao Wei ◽

...

Keyword(s):

Loss Function ◽

Metric Learning ◽

Deep Metric Learning

Download Full-text

Ranked List Loss for Deep Metric Learning

IEEE Transactions on Pattern Analysis and Machine Intelligence ◽

10.1109/tpami.2021.3068449 ◽

2021 ◽

pp. 1-1

Author(s):

Xinshao Wang ◽

Yang Hua ◽

Elyor Kodirov ◽

Neil M Robertson

Keyword(s):

Metric Learning ◽

Deep Metric Learning ◽

Ranked List

Download Full-text

Predicting TCR-Epitope Binding Specificity Using Deep Metric Learning and Multimodal Learning

Genes ◽

10.3390/genes12040572 ◽

2021 ◽

Vol 12 (4) ◽

pp. 572

Author(s):

Alan M. Luu ◽

Jacob R. Leistico ◽

Tim Miller ◽

Somang Kim ◽

Jun S. Song

Keyword(s):

Neural Network ◽

Amino Acid ◽

Cytotoxic T Cells ◽

Metric Learning ◽

Binding Specificity ◽

Class I ◽

Multimodal Learning ◽

Binding Prediction ◽

Deep Metric Learning ◽

Epitope Binding

Understanding the recognition of specific epitopes by cytotoxic T cells is a central problem in immunology. Although predicting binding between peptides and the class I Major Histocompatibility Complex (MHC) has had success, predicting interactions between T cell receptors (TCRs) and MHC class I-peptide complexes (pMHC) remains elusive. This paper utilizes a convolutional neural network model employing deep metric learning and multimodal learning to perform two critical tasks in TCR-epitope binding prediction: identifying the TCRs that bind a given epitope from a TCR repertoire, and identifying the binding epitope of a given TCR from a list of candidate epitopes. Our model can perform both tasks simultaneously and reveals that inconsistent preprocessing of TCR sequences can confound binding prediction. Applying a neural network interpretation method identifies key amino acid sequence patterns and positions within the TCR, important for binding specificity. Contrary to common assumption, known crystal structures of TCR-pMHC complexes show that the predicted salient amino acid positions are not necessarily the closest to the epitopes, implying that physical proximity may not be a good proxy for importance in determining TCR-epitope specificity. Our work thus provides an insight into the learned predictive features of TCR-epitope binding specificity and advances the associated classification tasks.

Download Full-text