A Fast and Accurate Approach to Multiple-Vehicle Localization and Tracking from Monocular Aerial Images

In this work we present a novel end-to-end solution for tracking objects (i.e., vessels), using video streams from aerial drones, in dynamic maritime environments. Our method relies on deep features, which are learned using realistic simulation data, for robust object detection, segmentation and tracking. Furthermore, we propose the use of rotated bounding-box representations, which are computed by taking advantage of pixel-level object segmentation, for improved tracking accuracy, by reducing erroneous data associations during tracking, when combined with the appearance-based features. A thorough set of experiments and results obtained in a realistic shipyard simulation environment, demonstrate that our method can accurately, and fast detect and track dynamic objects seen from a top-view.

Download Full-text

Localization and Tracking of an Indoor Autonomous Vehicle Based on the Phase Difference of Passive UHF RFID Signals

Sensors ◽

10.3390/s21093286 ◽

2021 ◽

Vol 21 (9) ◽

pp. 3286

Author(s):

Yunlei Zhang ◽

Xiaolin Gong ◽

Kaihua Liu ◽

Shuai Zhang

Keyword(s):

Phase Difference ◽

Radio Frequency Identification ◽

Autonomous Vehicles ◽

Autonomous Vehicle ◽

Localization Algorithm ◽

Uhf Rfid ◽

Dual Frequency ◽

Tracking Accuracy ◽

Vehicle Localization ◽

Localization And Tracking

State-of-the-art radio frequency identification (RFID)-based indoor autonomous vehicles localization methods are mostly based on received signal strength indicator (RSSI) measurements. However, the accuracy of these methods is not high enough for real-world scenarios. To overcome this problem, a novel dual-frequency phase difference of arrival (PDOA) ranging-based indoor autonomous vehicle localization and tracking scheme was developed. Firstly, the method gets the distance between the RFID reader and the tag by dual-frequency PDOA ranging. Then, a maximum likelihood estimation and semi-definite programming (SDP)-based localization algorithm is utilized to calculate the position of the autonomous vehicles, which can mitigate the multipath ranging error and obtain a more accurate positioning result. Finally, vehicle traveling information and the position achieved by RFID localization are fused with a Kalman filter (KF). The proposed method can work in a low-density tag deployment environment. Simulation experiment results showed that the proposed vehicle localization and tracking method achieves centimeter-level mean tracking accuracy.

Download Full-text

Real-time Vehicle Localization and Tracking Using Monocular Panomorph Panoramic Vision

2018 24th International Conference on Pattern Recognition (ICPR) ◽

10.1109/icpr.2018.8546104 ◽

2018 ◽

Author(s):

B. Akdemir ◽

A.N. Belbachir ◽

L.M. Svendsen

Keyword(s):

Real Time ◽

Vehicle Localization ◽

Panoramic Vision ◽

Localization And Tracking

Download Full-text

CRITICAL ASSESSMENT OF OBJECT SEGMENTATION IN AERIAL IMAGE USING GEO-HAUSDORFF DISTANCE

ISPRS - International Archives of the Photogrammetry, Remote Sensing and Spatial Information Sciences ◽

10.5194/isprsarchives-xli-b4-187-2016 ◽

2016 ◽

Vol XLI-B4 ◽

pp. 187-194 ◽

Cited By ~ 1

Author(s):

H. Sun ◽

Y. Ding ◽

Y. Huang ◽

G. Wang

Keyword(s):

Hausdorff Distance ◽

Object Segmentation ◽

Signal To Noise Ratio ◽

Limited Resource ◽

Ground Truth ◽

Aerial Images ◽

Aerial Image ◽

Spatial Distance ◽

Valuable Insight ◽

Spatial Correspondence

Aerial Image records the large-range earth objects with the ever-improving spatial and radiometric resolution. It becomes a powerful tool for earth observation, land-coverage survey, geographical census, etc., and helps delineating the boundary of different kinds of objects on the earth both manually and automatically. In light of the geo-spatial correspondence between the pixel locations of aerial image and the spatial coordinates of ground objects, there is an increasing need of super-pixel segmentation and high-accuracy positioning of objects in aerial image. Besides the commercial software package of eCognition and ENVI, many algorithms have also been developed in the literature to segment objects of aerial images. But how to evaluate the segmentation results remains a challenge, especially in the context of the geo-spatial correspondence. The Geo-Hausdorff Distance (GHD) is proposed to measure the geo-spatial distance between the results of various object segmentation that can be done with the manual ground truth or with the automatic algorithms.Based on the early-breaking and random-sampling design, the GHD calculates the geographical Hausdorff distance with nearly-linear complexity. Segmentation results of several state-of-the-art algorithms, including those of the commercial packages, are evaluated with a diverse set of aerial images. They have different signal-to-noise ratio around the object boundaries and are hard to trace correctly even for human operators. The GHD value is analyzed to comprehensively measure the suitability of different object segmentation methods for aerial images of different spatial resolution. By critically assessing the strengths and limitations of the existing algorithms, the paper provides valuable insight and guideline for extensive research in automating object detection and classification of aerial image in the nation-wide geographic census. It is also promising for the optimal design of operational specification of remote sensing interpretation under the constraints of limited resource.

Download Full-text

Unsupervised Learning in RSS-Based DFLT Using an EM Algorithm

Sensors ◽

10.3390/s21165549 ◽

2021 ◽

Vol 21 (16) ◽

pp. 5549

Author(s):

Ossi Kaltiokallio ◽

Roland Hostettler ◽

Hüseyin Yiğitler ◽

Mikko Valkama

Keyword(s):

Em Algorithm ◽

Expectation Maximization ◽

Tracking System ◽

Model Parameters ◽

Calibration Data ◽

Tracking Accuracy ◽

Time Period ◽

The Em Algorithm ◽

Device Free ◽

Localization And Tracking

Received signal strength (RSS) changes of static wireless nodes can be used for device-free localization and tracking (DFLT). Most RSS-based DFLT systems require access to calibration data, either RSS measurements from a time period when the area was not occupied by people, or measurements while a person stands in known locations. Such calibration periods can be very expensive in terms of time and effort, making system deployment and maintenance challenging. This paper develops an Expectation-Maximization (EM) algorithm based on Gaussian smoothing for estimating the unknown RSS model parameters, liberating the system from supervised training and calibration periods. To fully use the EM algorithm’s potential, a novel localization-and-tracking system is presented to estimate a target’s arbitrary trajectory. To demonstrate the effectiveness of the proposed approach, it is shown that: (i) the system requires no calibration period; (ii) the EM algorithm improves the accuracy of existing DFLT methods; (iii) it is computationally very efficient; and (iv) the system outperforms a state-of-the-art adaptive DFLT system in terms of tracking accuracy.

Download Full-text

Estimation for Motion in Tracking and Detection Objects with Kalman Filter

Dynamic Data Assimilation - Beating the Uncertainties ◽

10.5772/intechopen.92863 ◽

2020 ◽

Author(s):

Afef Salhi ◽

Fahmi Ghozzi ◽

Ahmed Fakhfakh

Keyword(s):

Computer Vision ◽

Kalman Filter ◽

Visual Motion ◽

Optimal Solution ◽

Block Matching ◽

Detection And Tracking ◽

Tracking Object ◽

Tracking Objects ◽

Localization And Tracking ◽

Prediction Parameters

The Kalman filter has long been regarded as the optimal solution to many applications in computer vision for example the tracking objects, prediction and correction tasks. Its use in the analysis of visual motion has been documented frequently, we can use in computer vision and open cv in different applications in reality for example robotics, military image and video, medical applications, security in public and privacy society, etc. In this paper, we investigate the implementation of a Matlab code for a Kalman Filter using three algorithm for tracking and detection objects in video sequences (block-matching (Motion Estimation) and Camshift Meanshift (localization, detection and tracking object)). The Kalman filter is presented in three steps: prediction, estimation (correction) and update. The first step is a prediction for the parameters of the tracking and detection objects. The second step is a correction and estimation of the prediction parameters. The important application in Kalman filter is the localization and tracking mono-objects and multi-objects are given in results. This works presents the extension of an integrated modeling and simulation tool for the tracking and detection objects in computer vision described at different models of algorithms in implementation systems.

Download Full-text

Mask R-CNN Refitting Strategy for Plant Counting and Sizing in UAV Imagery

Remote Sensing ◽

10.3390/rs12183015 ◽

2020 ◽

Vol 12 (18) ◽

pp. 3015 ◽

Cited By ~ 1

Author(s):

Mélissande Machefer ◽

François Lemarchand ◽

Virginie Bonnefond ◽

Alasdair Hitchins ◽

Panagiotis Sidiropoulos

Keyword(s):

Deep Learning ◽

Fine Tuning ◽

Aerial Images ◽

Individual Plant ◽

Low Density ◽

Tracking Accuracy ◽

Potato Plants ◽

The Individual ◽

Segmentation Task ◽

Vision Algorithm

This work introduces a method that combines remote sensing and deep learning into a framework that is tailored for accurate, reliable and efficient counting and sizing of plants in aerial images. The investigated task focuses on two low-density crops, potato and lettuce. This double objective of counting and sizing is achieved through the detection and segmentation of individual plants by fine-tuning an existing deep learning architecture called Mask R-CNN. This paper includes a thorough discussion on the optimal parametrisation to adapt the Mask R-CNN architecture to this novel task. As we examine the correlation of the Mask R-CNN performance to the annotation volume and granularity (coarse or refined) of remotely sensed images of plants, we conclude that transfer learning can be effectively used to reduce the required amount of labelled data. Indeed, a previously trained Mask R-CNN on a low-density crop can improve performances after training on new crops. Once trained for a given crop, the Mask R-CNN solution is shown to outperform a manually-tuned computer vision algorithm. Model performances are assessed using intuitive metrics such as Mean Average Precision (mAP) from Intersection over Union (IoU) of the masks for individual plant segmentation and Multiple Object Tracking Accuracy (MOTA) for detection. The presented model reaches an mAP of 0.418 for potato plants and 0.660 for lettuces for the individual plant segmentation task. In detection, we obtain a MOTA of 0.781 for potato plants and 0.918 for lettuces.

Download Full-text

An intelligent video system for vehicle localization and tracking in police cars

Proceedings of the 2009 ACM symposium on Applied Computing - SAC '09 ◽

10.1145/1529282.1529486 ◽

2009 ◽

Cited By ~ 3

Author(s):

Amirali Jazayeri ◽

Hongyuan Cai ◽

Jiang Yu Zheng ◽

Mihran Tuceryan ◽

Herbert Blitzer

Keyword(s):

Vehicle Localization ◽

Video System ◽

Police Cars ◽

Localization And Tracking

Download Full-text

METHODS OF IMAGE RECOGNITION IN A VIDEO STREAM

Applied Mathematics and Fundamental Informatics ◽

10.25206/2311-4908-2021-8-1-42-49 ◽

2021 ◽

Vol 8 (1) ◽

pp. 042-049

Author(s):

D. I. Ivanov ◽

Keyword(s):

Video Processing ◽

Moving Objects ◽

Video Stream ◽

Multiple Instance Learning ◽

Learning Tools ◽

Tracking Accuracy ◽

Correlation Filters ◽

Advantages And Disadvantages ◽

Kernelized Correlation Filters ◽

Tracking Objects

The article examines the problem of automatic object recognition using a video stream as a digital image. Algorithms for recognizing and tracking objects in the video stream are considered, methods used in video processing are analyzed, and the use of machine learning tools in working with video is described.The main approaches to solving the problem of recognizing moving objects in a video stream are investigated: the detection-based approach and the tracking-based approach. Arguments are made in favor of the tracking-based approach, and, in addition, modern methods of tracking objects in the video stream are considered. In particular, the algorhythms: Online Boosting Tracker - one of the first object tracking algorithms with high tracking accuracy, MIL Tracker (Multiple Instance Learning Tracker), which is a development of the idea of learning with a teacher and the Online Boosting algorithm and the KCF Tracker algorithm (Kernelized Correlation Filters Tracker) - a method that uses the mathematical properties of overlapping areas of positive examples.As a result, the advantages and disadvantages of the considered methods and algorithms for recognizing and tracking objects for various applications are highlighted.

Download Full-text

A Face Tracking Method Based on Camshift and SCKF in Rapid Moving Process

Applied Mechanics and Materials ◽

10.4028/www.scientific.net/amm.668-669.1025 ◽

2014 ◽

Vol 668-669 ◽

pp. 1025-1028

Author(s):

Fu Cheng Cao ◽

Xiao Xue Xing

Keyword(s):

Nonlinear Effect ◽

Face Tracking ◽

Uniform Motion ◽

Square Root ◽

Robust Tracking ◽

Tracking Accuracy ◽

Tracking Method ◽

Camshift Algorithm ◽

Moving Process ◽

Localization And Tracking

Aiming at the problem of face tracking under rapid moving process, a fast and robust tracking method is proposed. The possible position of face detected by the Camshift algorithm in the next frame is predicted by the square-root cubature Kalman filte (SCKF). Then, the localization and tracking of face are got frames by frames. The experimental results show that: the use of SCKF to solve the nonlinear effect caused by non-uniform motion of face and overcome the target loss problem of the linear Kalman algorithm. The proposed method greatly improves the tracking accuracy of face in the process of rapid movement.

Download Full-text

Real-Time Multiobject Tracking Based on Multiway Concurrency

Sensors ◽

10.3390/s21030685 ◽

2021 ◽

Vol 21 (3) ◽

pp. 685

Author(s):

Xuan Gong ◽

Zichun Le ◽

Yukun Wu ◽

Hui Wang

Keyword(s):

Real Time ◽

System Level ◽

Depth Image ◽

Tracking Accuracy ◽

Video Streams ◽

The Real ◽

Practical Applications ◽

Time Performance ◽

Tracking By Detection ◽

Multiobject Tracking

This paper explored a pragmatic approach to research the real-time performance of a multiway concurrent multiobject tracking (MOT) system. At present, most research has focused on the tracking of single-image sequences, but in practical applications, multiway video streams need to be processed in parallel by MOT systems. There have been few studies on the real-time performance of multiway concurrent MOT systems. In this paper, we proposed a new MOT framework to solve multiway concurrency scenario based on a tracking-by-detection (TBD) model. The new framework mainly focuses on concurrency and real-time based on limited computing and storage resources, while considering the algorithm performance. For the former, three aspects were studied: (1) Expanded width and depth of tracking-by-detection model. In terms of width, the MOT system can support the process of multiway video sequence at the same time; in terms of depth, image collectors and bounding box collectors were introduced to support batch processing. (2) Considering the real-time performance and multiway concurrency ability, we proposed one kind of real-time MOT algorithm based on directly driven detection. (3) Optimization of system level—we also utilized the inference optimization features of NVIDIA TensorRT to accelerate the deep neural network (DNN) in the tracking algorithm. To trade off the performance of the algorithm, a negative sample (false detection sample) filter was designed to ensure tracking accuracy. Meanwhile, the factors that affect the system real-time performance and concurrency were studied. The experiment results showed that our method has a good performance in processing multiple concurrent real-time video streams.

Download Full-text