The Multi-task Fully Convolutional Siamese Network with Correlation Filter Layer for Real-Time Visual Tracking

Online training framework based on discriminative correlation filters for visual tracking has recently shown significant improvement in both accuracy and speed. However, correlation filter-base discriminative approaches have a common problem of tracking performance degradation when the local structure of a target is distorted by the boundary effect problem. The shape distortion of the target is mainly caused by the circulant structure in the Fourier domain processing, and it makes the correlation filter learn distorted training samples. In this paper, we present a structure–attention network to preserve the target structure from the structure distortion caused by the boundary effect. More specifically, we adopt a variational auto-encoder as a structure–attention network to make various and representative target structures. We also proposed two denoising criteria using a novel reconstruction loss for variational auto-encoding framework to capture more robust structures even under the boundary condition. Through the proposed structure–attention framework, discriminative correlation filters can learn robust structure information of targets during online training with an enhanced discriminating performance and adaptability. Experimental results on major visual tracking benchmark datasets show that the proposed method produces a better or comparable performance compared with the state-of-the-art tracking methods with a real-time processing speed of more than 80 frames per second.

Download Full-text

Structured Siamese Network for Real-Time Visual Tracking

Computer Vision – ECCV 2018 - Lecture Notes in Computer Science ◽

10.1007/978-3-030-01240-3_22 ◽

2018 ◽

pp. 355-370 ◽

Cited By ~ 42

Author(s):

Yunhua Zhang ◽

Lijun Wang ◽

Jinqing Qi ◽

Dong Wang ◽

Mengyang Feng ◽

...

Keyword(s):

Real Time ◽

Visual Tracking ◽

Siamese Network

Download Full-text

Multi-level prediction Siamese network for real-time UAV visual tracking

Image and Vision Computing ◽

10.1016/j.imavis.2020.104002 ◽

2020 ◽

Vol 103 ◽

pp. 104002 ◽

Cited By ~ 1

Author(s):

Mu Zhu ◽

Hui Zhang ◽

Jing Zhang ◽

Li Zhuo

Keyword(s):

Real Time ◽

Visual Tracking ◽

Siamese Network ◽

Multi Level

Download Full-text

SiamDA: Dual attention siamese network for real-time visual tracking

Signal Processing Image Communication ◽

10.1016/j.image.2021.116293 ◽

2021 ◽

pp. 116293

Author(s):

Lei Pu ◽

Xinxi Feng ◽

Zhiqiang Hou ◽

Wangsheng Yu ◽

Yufei Zha

Keyword(s):

Real Time ◽

Visual Tracking ◽

Siamese Network

Download Full-text

Real-time visual tracking via robust Kernelized Correlation Filter

2017 IEEE International Conference on Robotics and Automation (ICRA) ◽

10.1109/icra.2017.7989514 ◽

2017 ◽

Cited By ~ 3

Author(s):

Xiaoliang Wang ◽

Marie O'Brien ◽

Changle Xiang ◽

Bin Xu ◽

Homayoun Najjaran

Keyword(s):

Real Time ◽

Visual Tracking ◽

Correlation Filter

Download Full-text

MHASiam: Mixed High-Order Attention Siamese Network for Real-Time Visual Tracking

Pattern Recognition and Computer Vision - Lecture Notes in Computer Science ◽

10.1007/978-3-030-60639-8_37 ◽

2020 ◽

pp. 445-456

Author(s):

Lei Pu ◽

Xinxi Feng ◽

Zhiqiang Hou ◽

Wangsheng Yu ◽

Yufei Zha ◽

...

Keyword(s):

Real Time ◽

Visual Tracking ◽

High Order ◽

Siamese Network

Download Full-text

Hierarchical attentive Siamese network for real-time visual tracking

Neural Computing and Applications ◽

10.1007/s00521-019-04238-1 ◽

2019 ◽

Vol 32 (18) ◽

pp. 14335-14346 ◽

Cited By ~ 2

Author(s):

Kang Yang ◽

Huihui Song ◽

Kaihua Zhang ◽

Qingshan Liu

Keyword(s):

Real Time ◽

Visual Tracking ◽

Siamese Network

Download Full-text

MFCFSiam: A Correlation-Filter-Guided Siamese Network with Multifeature for Visual Tracking

Wireless Communications and Mobile Computing ◽

10.1155/2020/6681391 ◽

2020 ◽

Vol 2020 ◽

pp. 1-19

Author(s):

Chenpu Li ◽

Qianjian Xing ◽

Zhenguo Ma ◽

Ke Zang

Keyword(s):

Visual Tracking ◽

Correlation Filter ◽

Evaluation Criterion ◽

Visual Object ◽

Semantic Features ◽

Similarity Learning ◽

Feature Maps ◽

Siamese Network ◽

Histograms Of Oriented Gradients ◽

High Level

With the development of deep learning, trackers based on convolutional neural networks (CNNs) have made significant achievements in visual tracking over the years. The fully connected Siamese network (SiamFC) is a typical representation of those trackers. SiamFC designs a two-branch architecture of a CNN and models’ visual tracking as a general similarity-learning problem. However, the feature maps it uses for visual tracking are only from the last layer of the CNN. Those features contain high-level semantic information but lack sufficiently detailed texture information. This means that the SiamFC tracker tends to drift when there are other same-category objects or when the contrast between the target and the background is very low. Focusing on addressing this problem, we design a novel tracking algorithm that combines a correlation filter tracker and the SiamFC tracker into one framework. In this framework, the correlation filter tracker can use the Histograms of Oriented Gradients (HOG) and color name (CN) features to guide the SiamFC tracker. This framework also contains an evaluation criterion which we design to evaluate the tracking result of the two trackers. If this criterion finds the SiamFC tracker fails in some cases, our framework will use the tracking result from the correlation filter tracker to correct the SiamFC. In this way, the defects of SiamFC’s high-level semantic features are remedied by the HOG and CN features. So, our algorithm provides a framework which combines two trackers together and makes them complement each other in visual tracking. And to the best of our knowledge, our algorithm is also the first one which designs an evaluation criterion using correlation filter and zero padding to evaluate the tracking result. Comprehensive experiments are conducted on the Online Tracking Benchmark (OTB), Temple Color (TC128), Benchmark for UAV Tracking (UAV-123), and Visual Object Tracking (VOT) Benchmark. The results show that our algorithm achieves quite a competitive performance when compared with the baseline tracker and several other state-of-the-art trackers.

Download Full-text