Joint learning hash codes and distance metric for visual tracking

Author(s):  
Luning Liu ◽  
Huchuan Lu ◽  
Xue Mei
Author(s):  
Donglin Zhang ◽  
Xiao-Jun Wu ◽  
He-Feng Yin ◽  
Josef Kittler

2021 ◽  
Author(s):  
Mingrui Chen ◽  
Weiyu Li ◽  
weizhi lu

Recently, it has been observed that $\{0,\pm1\}$-ternary codes which are simply generated from deep features by hard thresholding, tend to outperform $\{-1, 1\}$-binary codes in image retrieval. To obtain better ternary codes, we for the first time propose to jointly learn the features with the codes by appending a smoothed function to the networks. During training, the function could evolve into a non-smoothed ternary function by a continuation method, and then generate ternary codes. The method circumvents the difficulty of directly training discrete functions and reduces the quantization errors of ternary codes. Experiments show that the proposed joint learning indeed could produce better ternary codes.


2021 ◽  
Author(s):  
Mingrui Chen ◽  
Weiyu Li ◽  
weizhi lu

Recently, it has been observed that $\{0,\pm1\}$-ternary codes which are simply generated from deep features by hard thresholding, tend to outperform $\{-1, 1\}$-binary codes in image retrieval. To obtain better ternary codes, we for the first time propose to jointly learn the features with the codes by appending a smoothed function to the networks. During training, the function could evolve into a non-smoothed ternary function by a continuation method, and then generate ternary codes. The method circumvents the difficulty of directly training discrete functions and reduces the quantization errors of ternary codes. Experiments show that the proposed joint learning indeed could produce better ternary codes.


Author(s):  
Siyuan Li ◽  
Zhi Zhang ◽  
Ziyu Liu ◽  
Anna Wang ◽  
Linglong Qiu ◽  
...  

Target localization and proposal generation are two essential subtasks in generic visual tracking, and it is a challenge to address both the two efficiently. In this paper, we propose an efficient two-stage architecture which makes full use of the complementarity of two subtasks to achieve robust localization and high-quality proposals generation of the target jointly. Specifically, our model performs a novel deformable central correlation operation by an online learning model in both two stages to locate new target centers while generating target proposals in the vicinity of these centers. The proposals are refined in the refinement stage to further improve accuracy and robustness. Moreover, the model benefits from multi-level features aggregation in a neck module and a feature enhancement module. We conduct extensive ablation studies to demonstrate the effectiveness of our proposed methods. Our tracker runs at over 30 FPS and sets a new state-of-the-art on five tracking benchmarks, including LaSOT, VOT2018, TrackingNet, GOT10k, OTB2015.


Author(s):  
Bo Liu ◽  
Meng Wang ◽  
Richang Hong ◽  
Zhengjun Zha ◽  
Xian-Sheng Hua

2019 ◽  
Vol 28 (05) ◽  
pp. 1
Author(s):  
Xiaohe Wu ◽  
Weisong Wang ◽  
Fei Yang ◽  
Hongzhi Zhang ◽  
Wangmeng Zuo

2018 ◽  
Vol 19 (1) ◽  
pp. 187-198 ◽  
Author(s):  
Shengping Zhang ◽  
Yuankai Qi ◽  
Feng Jiang ◽  
Xiangyuan Lan ◽  
Pong C. Yuen ◽  
...  

Sign in / Sign up

Export Citation Format

Share Document