Confident Semantic Ranking Loss for Part Parsing

In this paper, aiming at two key problems of instance-level image retrieval, i.e., the distinctiveness of image representation and the generalization ability of the model, we propose a novel deep architecture - Multiple Saliency and Channel Sensitivity Network(MSCNet). Specifically, to obtain distinctive global descriptors, an attention-based multiple saliency learning is first presented to highlight important details of the image, and then a simple but effective channel sensitivity module based on Gram matrix is designed to boost the channel discrimination and suppress redundant information. Additionally, in contrast to most existing feature aggregation methods, employing pre-trained deep networks, MSCNet can be trained in two modes: the first one is an unsupervised manner with an instance loss, and another is a supervised manner, which combines classification and ranking loss and only relies on very limited training data. Experimental results on several public benchmark datasets, i.e., Oxford buildings, Paris buildings and Holidays, indicate that the proposed MSCNet outperforms the state-of-the-art unsupervised and supervised methods.

Download Full-text

Multimodal Learning with Triplet Ranking Loss for Visual Semantic Embedding Learning

Knowledge Science, Engineering and Management - Lecture Notes in Computer Science ◽

10.1007/978-3-030-29551-6_67 ◽

2019 ◽

pp. 763-773

Author(s):

Zhanbo Yang ◽

Li Li ◽

Jun He ◽

Zixi Wei ◽

Li Liu ◽

...

Keyword(s):

Multimodal Learning ◽

Semantic Embedding ◽

Ranking Loss

Download Full-text

Centralized Ranking Loss with Weakly Supervised Localization for Fine-Grained Object Retrieval

Proceedings of the Twenty-Seventh International Joint Conference on Artificial Intelligence ◽

10.24963/ijcai.2018/171 ◽

2018 ◽

Cited By ~ 9

Author(s):

Xiawu Zheng ◽

Rongrong Ji ◽

Xiaoshuai Sun ◽

Yongjian Wu ◽

Feiyue Huang ◽

...

Keyword(s):

State Of The Art ◽

Feature Learning ◽

Target Object ◽

Object Retrieval ◽

Unified Framework ◽

Fine Grained ◽

Discriminative Feature ◽

Triplet Loss ◽

Weakly Supervised ◽

Ranking Loss

Fine-grained object retrieval has attracted extensive research focus recently. Its state-of-the-art schemesare typically based upon convolutional neural network (CNN) features. Despite the extensive progress, two issues remain open. On one hand, the deep features are coarsely extracted at image level rather than precisely at object level, which are interrupted by background clutters. On the other hand, training CNN features with a standard triplet loss is time consuming and incapable to learn discriminative features. In this paper, we present a novel fine-grained object retrieval scheme that conquers these issues in a unified framework. Firstly, we introduce a novel centralized ranking loss (CRL), which achieves a very efficient (1,000times training speedup comparing to the triplet loss) and discriminative feature learning by a ?centralized? global pooling. Secondly, a weakly supervised attractive feature extraction is proposed, which segments object contours with top-down saliency. Consequently, the contours are integrated into the CNN response map to precisely extract features ?within? the target object. Interestingly, we have discovered that the combination of CRL and weakly supervised learning can reinforce each other. We evaluate the performance ofthe proposed scheme on widely-used benchmarks including CUB200-2011 and CARS196. We havereported significant gains over the state-of-the-art schemes, e.g., 5.4% over SCDA [Wei et al., 2017]on CARS196, and 3.7% on CUB200-2011.

Download Full-text

Matching anticancer compounds and tumor cell lines by neural networks with ranking loss

NAR Genomics and Bioinformatics ◽

10.1093/nargab/lqab128 ◽

2022 ◽

Vol 4 (1) ◽

Author(s):

Paul Prasse ◽

Pascal Iversen ◽

Matthias Lienhard ◽

Kristina Thedinga ◽

Chris Bauer ◽

...

Keyword(s):

Tumor Cell ◽

Cell Line ◽

Cancer Cell ◽

Drug Sensitivity ◽

Inhibitory Concentration ◽

Cancer Cell Line ◽

Tumor Cell Line ◽

Ranking Problem ◽

Ranking Loss

ABSTRACT Computational drug sensitivity models have the potential to improve therapeutic outcomes by identifying targeted drug components that are likely to achieve the highest efficacy for a cancer cell line at hand at a therapeutic dose. State of the art drug sensitivity models use regression techniques to predict the inhibitory concentration of a drug for a tumor cell line. This regression objective is not directly aligned with either of these principal goals of drug sensitivity models: We argue that drug sensitivity modeling should be seen as a ranking problem with an optimization criterion that quantifies a drug’s inhibitory capacity for the cancer cell line at hand relative to its toxicity for healthy cells. We derive an extension to the well-established drug sensitivity regression model PaccMann that employs a ranking loss and focuses on the ratio of inhibitory concentration and therapeutic dosage range. We find that the ranking extension significantly enhances the model’s capability to identify the most effective anticancer drugs for unseen tumor cell profiles based in on in-vitro data.

Download Full-text

Feature Re-Learning for Video Recommendation

International Journal for Research in Applied Science and Engineering Technology ◽

10.22214/ijraset.2021.35350 ◽

2021 ◽

Vol 9 (VI) ◽

pp. 3143-3149

Author(s):

Chanjal C

Keyword(s):

Data Augmentation ◽

Video Retrieval ◽

Poor Performance ◽

Training Data ◽

Video Annotation ◽

Video Copy Detection ◽

Copy Detection ◽

Augmentation Strategy ◽

Ranking Loss ◽

Video Recommendation

Predicting the relevance between two given videos with respect to their visual content is a key component for content-based video recommendation and retrieval. The application is in video recommendation, video annotation, Category or near-duplicate video retrieval, video copy detection and so on. In order to estimate video relevance previous works utilize textual content of videos and lead to poor performance. The proposed method is feature re-learning for video relevance prediction. This work focus on the visual contents to predict the relevance between two videos. A given feature is projected into a new space by an affine transformation. Different from previous works this use a standard triplet ranking loss that optimize the projection process by a novel negative-enhanced triplet ranking loss. In order to generate more training data, propose a data augmentation strategy which works directly on video features. The multi-level augmentation strategy works for video features, which benefits the feature relearning. The proposed augmentation strategy can be flexibly used for frame-level or video-level features. The loss function that consider the absolute similarity of positive pairs and supervise the feature re-learning process and a new formula for video relevance computation.

Download Full-text