Visual Tracking Based on Complementary Learners with Distractor Handling

The representation of the object is an important factor in building a robust visual object tracking algorithm. To resolve this problem, complementary learners that use color histogram- and correlation filter-based representation to represent the target object can be used since they each have advantages that can be exploited to compensate the other’s drawback in visual tracking. Further, a tracking algorithm can fail because of the distractor, even when complementary learners have been implemented for the target object representation. In this study, we show that, in order to handle the distractor, first the distractor must be detected by learning the responses from the color-histogram- and correlation-filter-based representation. Then, to determine the target location, we can decide whether the responses from each representation should be merged or only the response from the correlation filter should be used. This decision depends on the result obtained from the distractor detection process. Experiments were performed on the widely used VOT2014 and VOT2015 benchmark datasets. It was verified that our proposed method performs favorably as compared with several state-of-the-art visual tracking algorithms.

Download Full-text

A Robust Visual Tracking Algorithm Based on Spatial-Temporal Context Hierarchical Response Fusion

Algorithms ◽

10.3390/a12010008 ◽

2018 ◽

Vol 12 (1) ◽

pp. 8 ◽

Cited By ~ 2

Author(s):

Wancheng Zhang ◽

Yanmin Luo ◽

Zhi Chen ◽

Yongzhao Du ◽

Daxin Zhu ◽

...

Keyword(s):

Visual Tracking ◽

Correlation Filter ◽

Temporal Context ◽

Visual Object ◽

Correlation Filters ◽

Visual Object Tracking ◽

Illumination Changes ◽

Model Update ◽

Benchmark Datasets ◽

Hierarchical Features

Discriminative correlation filters (DCFs) have been shown to perform superiorly in visual object tracking. However, visual tracking is still challenging when the target objects undergo complex scenarios such as occlusion, deformation, scale changes and illumination changes. In this paper, we utilize the hierarchical features of convolutional neural networks (CNNs) and learn a spatial-temporal context correlation filter on convolutional layers. Then, the translation is estimated by fusing the response score of the filters on the three convolutional layers. In terms of scale estimation, we learn a discriminative correlation filter to estimate scale from the best confidence results. Furthermore, we proposed a re-detection activation discrimination method to improve the robustness of visual tracking in the case of tracking failure and an adaptive model update method to reduce tracking drift caused by noisy updates. We evaluate the proposed tracker with DCFs and deep features on OTB benchmark datasets. The tracking results demonstrated that the proposed algorithm is superior to several state-of-the-art DCF methods in terms of accuracy and robustness.

Download Full-text

Distractor-Aware Deep Regression for Visual Tracking

Sensors ◽

10.3390/s19020387 ◽

2019 ◽

Vol 19 (2) ◽

pp. 387 ◽

Cited By ~ 1

Author(s):

Ming Du ◽

Yan Ding ◽

Xiuyun Meng ◽

Hua-Liang Wei ◽

Yifan Zhao

Keyword(s):

Object Tracking ◽

Visual Tracking ◽

Test Data ◽

Loss Function ◽

State Of The Art ◽

Target Object ◽

Visual Object ◽

Visual Object Tracking ◽

Training Samples ◽

Better Than

In recent years, regression trackers have drawn increasing attention in the visual-object tracking community due to their favorable performance and easy implementation. The tracker algorithms directly learn mapping from dense samples around the target object to Gaussian-like soft labels. However, in many real applications, when applied to test data, the extreme imbalanced distribution of training samples usually hinders the robustness and accuracy of regression trackers. In this paper, we propose a novel effective distractor-aware loss function to balance this issue by highlighting the significant domain and by severely penalizing the pure background. In addition, we introduce a full differentiable hierarchy-normalized concatenation connection to exploit abstractions across multiple convolutional layers. Extensive experiments were conducted on five challenging benchmark-tracking datasets, that is, OTB-13, OTB-15, TC-128, UAV-123, and VOT17. The experimental results are promising and show that the proposed tracker performs much better than nearly all the compared state-of-the-art approaches.

Download Full-text

CFNN: Correlation Filter Neural Network for Visual Object Tracking

Proceedings of the Twenty-Sixth International Joint Conference on Artificial Intelligence ◽

10.24963/ijcai.2017/309 ◽

2017 ◽

Cited By ~ 2

Author(s):

Yang Li ◽

Zhan Xu ◽

Jianke Zhu

Keyword(s):

Neural Network ◽

Visual Tracking ◽

Network Architecture ◽

Back Propagation ◽

Correlation Filter ◽

Visual Object ◽

Neural Network Architecture ◽

Visual Object Tracking ◽

Single Target ◽

Wide Range

Albeit convolutional neural network (CNN) has shown promising capacity in many computer vision tasks, applying it to visual tracking is yet far from solved. Existing methods either employ a large external dataset to undertake exhaustive pre-training or suffer from less satisfactory results in terms of accuracy and robustness. To track single target in a wide range of videos, we present a novel Correlation Filter Neural Network architecture, as well as a complete visual tracking pipeline, The proposed approach is a special case of CNN, whose initialization does not need any pre-training on the external dataset. The initialization of network enjoys the merits of cyclic sampling to achieve the appealing discriminative capability, while the network updating scheme adopts advantages from back-propagation in order to capture new appearance variations. The tracking pipeline integrates both aspects well by making them complementary to each other. We validate our tracker on OTB-2013 benchmark. The proposed tracker obtains the promising results compared to most of existing representative trackers.

Download Full-text

Robust Scale Adaptive Visual Tracking with Correlation Filters

Applied Sciences ◽

10.3390/app8112037 ◽

2018 ◽

Vol 8 (11) ◽

pp. 2037 ◽

Cited By ~ 1

Author(s):

Chunbao Li ◽

Bo Yang

Keyword(s):

Visual Tracking ◽

State Of The Art ◽

Estimation Method ◽

Color Naming ◽

Target Object ◽

Correlation Filter ◽

Correlation Filters ◽

Object Proposals ◽

Benchmark Datasets ◽

Candidate Object

Visual tracking is a challenging task in computer vision due to various appearance changes of the target object. In recent years, correlation filter plays an important role in visual tracking and many state-of-the-art correlation filter based trackers are proposed in the literature. However, these trackers still have certain limitations. Most of existing trackers cannot well deal with scale variation, and they may easily drift to the background in the case of occlusion. To overcome the above problems, we propose a Correlation Filters based Scale Adaptive (CFSA) visual tracker. In the tracker, a modified EdgeBoxes generator, is proposed to generate high-quality candidate object proposals for tracking. The pool of generated candidate object proposals is adopted to estimate the position of the target object using a kernelized correlation filter based tracker with HOG and color naming features. In order to deal with changes in target scale, a scale estimation method is proposed by combining the water flow driven MBD (minimum barrier distance) algorithm with the estimated position. Furthermore, an online updating schema is adopted to reduce the interference of the surrounding background. Experimental results on two large benchmark datasets demonstrate that the CFSA tracker achieves favorable performance compared with the state-of-the-art trackers.

Download Full-text

High speed long-term visual object tracking algorithm for real robot systems

Neurocomputing ◽

10.1016/j.neucom.2020.12.113 ◽

2021 ◽

Vol 434 ◽

pp. 268-284

Author(s):

Muxi Jiang ◽

Rui Li ◽

Qisheng Liu ◽

Yingjing Shi ◽

Esteban Tlelo-Cuautle

Keyword(s):

Object Tracking ◽

High Speed ◽

Tracking Algorithm ◽

Visual Object ◽

Visual Object Tracking ◽

Robot Systems ◽

Real Robot

Download Full-text

Multipath Based Correlation Filter for Visual Object Tracking

Lecture Notes in Computer Science - Pattern Recognition and Machine Intelligence ◽

10.1007/978-3-030-34872-4_54 ◽

2019 ◽

pp. 490-498

Author(s):

Himadri Sekhar Bhunia ◽

Alok Kanti Deb ◽

Jayanta Mukhopadhyay

Keyword(s):

Object Tracking ◽

Correlation Filter ◽

Visual Object ◽

Visual Object Tracking

Download Full-text

Correlation Filter with Deep Feature for Visual Object Tracking

10.1007/978-981-16-6242-3_4 ◽

2021 ◽

pp. 85-127

Author(s):

Weiwei Xing ◽

Weibin Liu ◽

Jun Wang ◽

Shunli Zhang ◽

Lihui Wang ◽

...

Keyword(s):

Object Tracking ◽

Correlation Filter ◽

Visual Object ◽

Visual Object Tracking ◽

Deep Feature

Download Full-text

Soft Mask Correlation Filter for Visual Object Tracking

2018 25th IEEE International Conference on Image Processing (ICIP) ◽

10.1109/icip.2018.8451607 ◽

2018 ◽

Author(s):

Yang Huo ◽

Yuehuan Wang ◽

Xiaoyun Yan ◽

Kaiheng Dai

Keyword(s):

Object Tracking ◽

Correlation Filter ◽

Visual Object ◽

Visual Object Tracking

Download Full-text

Learning Soft Mask Based Feature Fusion with Channel and Spatial Attention for Robust Visual Object Tracking

Sensors ◽

10.3390/s20144021 ◽

2020 ◽

Vol 20 (14) ◽

pp. 4021 ◽

Cited By ~ 2

Author(s):

Mustansar Fiaz ◽

Arif Mahmood ◽

Soon Ki Jung

Keyword(s):

Object Tracking ◽

Spatial Attention ◽

Feature Fusion ◽

State Of The Art ◽

Feature Representation ◽

Visual Object ◽

Target Feature ◽

Visual Object Tracking ◽

Low Level ◽

Benchmark Datasets

We propose to improve the visual object tracking by introducing a soft mask based low-level feature fusion technique. The proposed technique is further strengthened by integrating channel and spatial attention mechanisms. The proposed approach is integrated within a Siamese framework to demonstrate its effectiveness for visual object tracking. The proposed soft mask is used to give more importance to the target regions as compared to the other regions to enable effective target feature representation and to increase discriminative power. The low-level feature fusion improves the tracker robustness against distractors. The channel attention is used to identify more discriminative channels for better target representation. The spatial attention complements the soft mask based approach to better localize the target objects in challenging tracking scenarios. We evaluated our proposed approach over five publicly available benchmark datasets and performed extensive comparisons with 39 state-of-the-art tracking algorithms. The proposed tracker demonstrates excellent performance compared to the existing state-of-the-art trackers.

Download Full-text