Long-Term Visual Object Tracking Benchmark

The Discriminative Correlation Filter (DCF) has been universally recognized in visual object tracking, thanks to its excellent accuracy and high speed. Nevertheless, these DCF-based trackers perform poorly in long-term tracking. The reasons include the following aspects—first, they have low adaptability to significant appearance changes in long-term tracking and are prone to tracking failure; second, these trackers lack a practical re-detection module to find the target again after tracking failure. In our work, we propose a new long-term tracking strategy to solve these issues. First, we make the best of the static and dynamic information of the target by introducing the motion features to our long-term tracker and obtain a more robust tracker. Second, we introduce a low-rank sparse dictionary learning method for re-detection. This re-detection module can exploit a correlation among these training samples and alleviate the impact of occlusion and noise. Third, we propose a new reliability evaluation method to model an adaptive update, which can switch expediently between the tracking module and the re-detection module. Massive experiments demonstrate that our proposed approach has an obvious improvement in precision and success rate over these state-of-the-art trackers.

Download Full-text

Re-identification framework for long term visual object tracking based on object detection and classification

Signal Processing Image Communication ◽

10.1016/j.image.2020.115969 ◽

2020 ◽

Vol 88 ◽

pp. 115969 ◽

Cited By ~ 1

Author(s):

Paraskevi Nousi ◽

Danai Triantafyllidou ◽

Anastasios Tefas ◽

Ioannis Pitas

Keyword(s):

Object Detection ◽

Object Tracking ◽

Visual Object ◽

Visual Object Tracking

Download Full-text

Re2EMA: Regularized and Reinitialized Exponential Moving Average for Target Model Update in Object Tracking

Proceedings of the AAAI Conference on Artificial Intelligence ◽

10.1609/aaai.v33i01.33018457 ◽

2019 ◽

Vol 33 ◽

pp. 8457-8464

Author(s):

Jianglei Huang ◽

Wengang Zhou

Keyword(s):

Object Tracking ◽

Moving Average ◽

Transformation Matrix ◽

Visual Object ◽

Optimal Model ◽

Visual Object Tracking ◽

Target Model ◽

Model Update ◽

Optimal Target

Target model update plays an important role in visual object tracking. However, performing optimal model update is challenging. In this work, we propose to achieve an optimal target model by learning a transformation matrix from the last target model to the newly generated one, which results into a minimization objective. In this objective, there exists two challenges. The first is that the newly generated target model is unreliable. To overcome this problem, we propose to impose a penalty to limit the distance between the learned target model and the last one. The second is that as time evolves, we can not decide whether the last target model has been corrupted or not. To get out of this dilemma, we propose a reinitialization term. Besides, to control the complexity of the transformation matrix, we also add a regularizer. We find that the optimization formula’s solution, with some simplifications, degenerates to EMA. Finally, despite the simplicity, extensive experiments conducted on several commonly used benchmarks demonstrate the effectiveness of our proposed approach in relatively long term scenarios.

Download Full-text

Improved Hierarchical Convolutional Features for Robust Visual Object Tracking

Complexity ◽

10.1155/2021/6690237 ◽

2021 ◽

Vol 2021 ◽

pp. 1-16

Author(s):

Jinping Sun

Keyword(s):

Object Tracking ◽

Target Position ◽

Feature Representation ◽

Correlation Filter ◽

Low Rank ◽

Visual Object ◽

Threshold Condition ◽

Current Frame ◽

Visual Object Tracking

The target and background will change continuously in the long-term tracking process, which brings great challenges to the accurate prediction of targets. The correlation filter algorithm based on manual features is difficult to meet the actual needs due to its limited feature representation ability. Thus, to improve the tracking performance and robustness, an improved hierarchical convolutional features model is proposed into a correlation filter framework for visual object tracking. First, the objective function is designed by lasso regression modeling, and a sparse, time-series low-rank filter is learned to increase the interpretability of the model. Second, the features of the last layer and the second pool layer of the convolutional neural network are extracted to realize the target position prediction from coarse to fine. In addition, using the filters learned from the first frame and the current frame to calculate the response maps, respectively, the target position is obtained by finding the maximum response value in the response map. The filter model is updated only when these two maximum responses meet the threshold condition. The proposed tracker is evaluated by simulation analysis on TC-128/OTB2015 benchmarks including more than 100 video sequences. Extensive experiments demonstrate that the proposed tracker achieves competitive performance against state-of-the-art trackers. The distance precision rate and overlap success rate of the proposed algorithm on OTB2015 are 0.829 and 0.695, respectively. The proposed algorithm effectively solves the long-term object tracking problem in complex scenes.

Download Full-text

Siamese networks with distractor-reduction method for long-term visual object tracking

Pattern Recognition ◽

10.1016/j.patcog.2020.107698 ◽

2020 ◽

pp. 107698

Author(s):

Shiyu Xuan ◽

Shengyang Li ◽

Zifei Zhao ◽

Longxuan Kou ◽

Zhuang Zhou ◽

...

Keyword(s):

Object Tracking ◽

Reduction Method ◽

Visual Object ◽

Visual Object Tracking ◽

Siamese Networks

Download Full-text

Long-Term Visual Object Tracking via Continual Learning

IEEE Access ◽

10.1109/access.2019.2960321 ◽

2019 ◽

Vol 7 ◽

pp. 182548-182558

Author(s):

Hui Zhang ◽

Mu Zhu ◽

Jing Zhang ◽

Li Zhuo

Keyword(s):

Object Tracking ◽

Visual Object ◽

Visual Object Tracking ◽

Continual Learning

Download Full-text

Visual object tracking using Fourier domain phase information

Signal Image and Video Processing ◽

10.1007/s11760-021-01968-5 ◽

2021 ◽

Author(s):

Serdar Cakir ◽

A. Enis Cetin

Keyword(s):

Object Tracking ◽

Visual Object ◽

Fourier Domain ◽

Phase Information ◽

Visual Object Tracking ◽

Domain Phase

Download Full-text

Adaptive Channel Selection for Robust Visual Object Tracking with Discriminative Correlation Filters

International Journal of Computer Vision ◽

10.1007/s11263-021-01435-1 ◽

2021 ◽

Author(s):

Tianyang Xu ◽

Zhenhua Feng ◽

Xiao-Jun Wu ◽

Josef Kittler

Keyword(s):

Object Tracking ◽

Augmented Lagrangian Method ◽

Channel Selection ◽

Image Feature ◽

Superior Performance ◽

Appearance Model ◽

Visual Object ◽

Correlation Filters ◽

Visual Object Tracking ◽

Feature Representations

AbstractDiscriminative Correlation Filters (DCF) have been shown to achieve impressive performance in visual object tracking. However, existing DCF-based trackers rely heavily on learning regularised appearance models from invariant image feature representations. To further improve the performance of DCF in accuracy and provide a parsimonious model from the attribute perspective, we propose to gauge the relevance of multi-channel features for the purpose of channel selection. This is achieved by assessing the information conveyed by the features of each channel as a group, using an adaptive group elastic net inducing independent sparsity and temporal smoothness on the DCF solution. The robustness and stability of the learned appearance model are significantly enhanced by the proposed method as the process of channel selection performs implicit spatial regularisation. We use the augmented Lagrangian method to optimise the discriminative filters efficiently. The experimental results obtained on a number of well-known benchmarking datasets demonstrate the effectiveness and stability of the proposed method. A superior performance over the state-of-the-art trackers is achieved using less than $$10\%$$ 10 % deep feature channels.

Download Full-text

Towards Accurate Estimation for Visual Object Tracking with Multi-hierarchy Feature Aggregation

Neurocomputing ◽

10.1016/j.neucom.2021.04.075 ◽

2021 ◽

Author(s):

Jingjing Wu ◽

Jianguo Jiang ◽

Meibin Qi ◽

Xiaohong Li

Keyword(s):

Object Tracking ◽

Accurate Estimation ◽

Visual Object ◽

Visual Object Tracking ◽

Feature Aggregation

Download Full-text