Single Object Tracking in Satellite Videos: Deep Siamese Network Incorporating an Interframe Difference Centroid Inertia Motion Model

Satellite video single object tracking has attracted wide attention. The development of remote sensing platforms for earth observation technologies makes it increasingly convenient to acquire high-resolution satellite videos, which greatly accelerates ground target tracking. However, overlarge images with small object size, high similarity among multiple moving targets, and poor distinguishability between the objects and the background make this task most challenging. To solve these problems, a deep Siamese network (DSN) incorporating an interframe difference centroid inertia motion (ID-CIM) model is proposed in this paper. In object tracking tasks, the DSN inherently includes a template branch and a search branch; it extracts the features from these two branches and employs a Siamese region proposal network to obtain the position of the target in the search branch. The ID-CIM mechanism was proposed to alleviate model drift. These two modules build the ID-DSN framework and mutually reinforce the final tracking results. In addition, we also adopted existing object detection datasets for remotely sensed images to generate training datasets suitable for satellite video single object tracking. Ablation experiments were performed on six high-resolution satellite videos acquired from the International Space Station and “Jilin-1” satellites. We compared the proposed ID-DSN results with other 11 state-of-the-art trackers, including different networks and backbones. The comparison results show that our ID-DSN obtained a precision criterion of 0.927 and a success criterion of 0.694 with a frames per second (FPS) value of 32.117 implemented on a single NVIDIA GTX1070Ti GPU.

Download Full-text

Research of single object tracking method based on Siamese Network and Level Set

2020 The 4th International Conference on Video and Image Processing ◽

10.1145/3447450.3447477 ◽

2020 ◽

Author(s):

Tianbo Liu ◽

Li Su ◽

Shuai Yuan ◽

Gong Cheng ◽

Feng Zhang

Keyword(s):

Object Tracking ◽

Level Set ◽

Single Object ◽

Tracking Method ◽

Siamese Network

Download Full-text

F-Siamese Tracker: A Frustum-based Double Siamese Network for 3D Single Object Tracking

2020 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS) ◽

10.1109/iros45743.2020.9341120 ◽

2020 ◽

Author(s):

Hao Zou ◽

Jinhao Cui ◽

Xin Kong ◽

Chujuan Zhang ◽

Yong Liu ◽

...

Keyword(s):

Object Tracking ◽

Single Object ◽

Siamese Network

Download Full-text

Multiple objects tracking in the UAV system based on hierarchical deep high-resolution network

Multimedia Tools and Applications ◽

10.1007/s11042-020-10427-1 ◽

2021 ◽

Author(s):

Wei Huang ◽

Xiaoshu Zhou ◽

Mingchao Dong ◽

Huaiyu Xu

Keyword(s):

High Resolution ◽

Object Tracking ◽

High Performance ◽

State Of The Art ◽

Class Imbalance ◽

Unified Framework ◽

Multiple Objects ◽

Tracking Process ◽

Objects Tracking ◽

Different Types

AbstractRobust and high-performance visual multi-object tracking is a big challenge in computer vision, especially in a drone scenario. In this paper, an online Multi-Object Tracking (MOT) approach in the UAV system is proposed to handle small target detections and class imbalance challenges, which integrates the merits of deep high-resolution representation network and data association method in a unified framework. Specifically, while applying tracking-by-detection architecture to our tracking framework, a Hierarchical Deep High-resolution network (HDHNet) is proposed, which encourages the model to handle different types and scales of targets, and extract more effective and comprehensive features during online learning. After that, the extracted features are fed into different prediction networks for interesting targets recognition. Besides, an adjustable fusion loss function is proposed by combining focal loss and GIoU loss to solve the problems of class imbalance and hard samples. During the tracking process, these detection results are applied to an improved DeepSORT MOT algorithm in each frame, which is available to make full use of the target appearance features to match one by one on a practical basis. The experimental results on the VisDrone2019 MOT benchmark show that the proposed UAV MOT system achieves the highest accuracy and the best robustness compared with state-of-the-art methods.

Download Full-text

HRSiam: High-Resolution Siamese Network, Towards Space-Borne Satellite Video Tracking

IEEE Transactions on Image Processing ◽

10.1109/tip.2020.3045634 ◽

2021 ◽

Vol 30 ◽

pp. 3056-3068

Author(s):

Jia Shao ◽

Bo Du ◽

Chen Wu ◽

Mingming Gong ◽

Tongliang Liu

Keyword(s):

High Resolution ◽

Video Tracking ◽

Siamese Network

Download Full-text

Single-scale Siamese Network Based RGB-D Object Tracking With Adaptive Bounding Boxes

Neurocomputing ◽

10.1016/j.neucom.2021.04.016 ◽

2021 ◽

Author(s):

Feng Xiao ◽

Qiuxia Wu ◽

Han Huang

Keyword(s):

Object Tracking ◽

Siamese Network ◽

Single Scale ◽

Bounding Boxes

Download Full-text

An Anchor-Free Siamese Network with Multi-Template Update for Object Tracking

Electronics ◽

10.3390/electronics10091067 ◽

2021 ◽

Vol 10 (9) ◽

pp. 1067

Author(s):

Tongtong Yuan ◽

Wenzhu Yang ◽

Qian Li ◽

Yuxia Wang

Keyword(s):

Object Tracking ◽

Correlation Energy ◽

Feature Maps ◽

Siamese Network ◽

Template Update ◽

Free Network ◽

Multiple Prediction ◽

Bounding Boxes ◽

High Level ◽

Speed And Accuracy

Siamese trackers are widely used in various fields for their advantages of balancing speed and accuracy. Compared with the anchor-based method, the anchor-free-based approach can reach faster speeds without any drop in precision. Inspired by the Siamese network and anchor-free idea, an anchor-free Siamese network (AFSN) with multi-template updates for object tracking is proposed. To improve tracking performance, a dual-fusion method is adopted in which the multi-layer features and multiple prediction results are combined respectively. The low-level feature maps are concatenated with the high-level feature maps to make full use of both spatial and semantic information. To make the results as stable as possible, the final results are obtained by combining multiple prediction results. Aiming at the template update, a high-confidence multi-template update mechanism is used. The average peak to correlation energy is used to determine whether the template should be updated. We use the anchor-free network to implement object tracking in a per-pixel manner, which computes the object category and bounding boxes directly. Experimental results indicate that the average overlap and success rate of the proposed algorithm increase by about 5% and 10%, respectively, compared to the SiamRPN++ algorithm when running on the dataset of GOT-10k (Generic Object Tracking Benchmark).

Download Full-text