Joint learning hash codes and distance metric for visual tracking

Recently, it has been observed that $\{0,\pm1\}$-ternary codes which are simply generated from deep features by hard thresholding, tend to outperform $\{-1, 1\}$-binary codes in image retrieval. To obtain better ternary codes, we for the first time propose to jointly learn the features with the codes by appending a smoothed function to the networks. During training, the function could evolve into a non-smoothed ternary function by a continuation method, and then generate ternary codes. The method circumvents the difficulty of directly training discrete functions and reduces the quantization errors of ternary codes. Experiments show that the proposed joint learning indeed could produce better ternary codes.

Download Full-text

Deep Learning to Ternary Hash Codes by Continuation

10.36227/techrxiv.17083019 ◽

2021 ◽

Author(s):

Mingrui Chen ◽

Weiyu Li ◽

weizhi lu

Keyword(s):

Deep Learning ◽

Image Retrieval ◽

Continuation Method ◽

Binary Codes ◽

Hard Thresholding ◽

Joint Learning ◽

Ternary Codes ◽

First Time ◽

Hash Codes ◽

Discrete Functions

Recently, it has been observed that $\{0,\pm1\}$-ternary codes which are simply generated from deep features by hard thresholding, tend to outperform $\{-1, 1\}$-binary codes in image retrieval. To obtain better ternary codes, we for the first time propose to jointly learn the features with the codes by appending a smoothed function to the networks. During training, the function could evolve into a non-smoothed ternary function by a continuation method, and then generate ternary codes. The method circumvents the difficulty of directly training discrete functions and reduces the quantization errors of ternary codes. Experiments show that the proposed joint learning indeed could produce better ternary codes.

Download Full-text

Joint Learning of Distance Metric and Kernel Classifier via Multiple Kernel Learning

Communications in Computer and Information Science - Pattern Recognition ◽

10.1007/978-981-10-3002-4_48 ◽

2016 ◽

pp. 586-600

Author(s):

Weiqi Zhang ◽

Zifei Yan ◽

Hongzhi Zhang ◽

Wangmeng Zuo

Keyword(s):

Multiple Kernel Learning ◽

Kernel Learning ◽

Distance Metric ◽

Joint Learning ◽

Multiple Kernel

Download Full-text

Joint Learning of Distance Metric and Query Model for Posteriorgram-Based Keyword Search

IEEE Journal of Selected Topics in Signal Processing ◽

10.1109/jstsp.2017.2762080 ◽

2017 ◽

Vol 11 (8) ◽

pp. 1318-1328 ◽

Cited By ~ 3

Author(s):

Batuhan Gundogdu ◽

Bolaji Yusuf ◽

Murat Saraclar

Keyword(s):

Keyword Search ◽

Distance Metric ◽

Joint Learning ◽

Query Model

Download Full-text

TLPG-Tracker: Joint Learning of Target Localization and Proposal Generation for Visual Tracking

Proceedings of the Twenty-Ninth International Joint Conference on Artificial Intelligence ◽

10.24963/ijcai.2020/99 ◽

2020 ◽

Author(s):

Siyuan Li ◽

Zhi Zhang ◽

Ziyu Liu ◽

Anna Wang ◽

Linglong Qiu ◽

...

Keyword(s):

Online Learning ◽

Visual Tracking ◽

State Of The Art ◽

Target Localization ◽

High Quality ◽

Two Stage ◽

Joint Learning ◽

Improve Accuracy ◽

Multi Level ◽

Two Stages

Target localization and proposal generation are two essential subtasks in generic visual tracking, and it is a challenge to address both the two efficiently. In this paper, we propose an efficient two-stage architecture which makes full use of the complementarity of two subtasks to achieve robust localization and high-quality proposals generation of the target jointly. Specifically, our model performs a novel deformable central correlation operation by an online learning model in both two stages to locate new target centers while generating target proposals in the vicinity of these centers. The proposals are refined in the refinement stage to further improve accuracy and robustness. Moreover, the model benefits from multi-level features aggregation in a neck module and a feature enhancement module. We conduct extensive ablation studies to demonstrate the effectiveness of our proposed methods. Our tracker runs at over 30 FPS and sets a new state-of-the-art on five tracking benchmarks, including LaSOT, VOT2018, TrackingNet, GOT10k, OTB2015.

Download Full-text