scholarly journals Object detection and tracking in video sequences: formalization, metrics and results

Informatics ◽  
2021 ◽  
Vol 18 (1) ◽  
pp. 43-60
Author(s):  
R. P. Bohush ◽  
S. V. Ablameyko

One of the promising areas of development and implementation of artificial intelligence is the automatic detection and tracking of moving objects in video sequence. The paper presents a formalization of the detection and tracking of one and many objects in video. The following metrics are considered: the quality of detection of tracked objects, the accuracy of determining the location of the object in a frame, the trajectory of movement, the accuracy of tracking multiple objects. Based on the considered generalization, an algorithm for tracking people has been developed that uses the tracking through detection method and convolutional neural networks to detect people and form features. Neural network features are included in a composite descriptor that also contains geometric and color features to describe each detected person in the frame. The results of experiments based on the considered criteria are presented, and it is experimentally confirmed that the improvement of the detector operation makes it possible to increase the accuracy of tracking objects. Examples of frames of processed video sequences with visualization of human movement trajectories are presented.

2008 ◽  
Vol 20 (6) ◽  
pp. 1452-1472 ◽  
Author(s):  
Xiangyu Tang ◽  
Christoph von der Malsburg

This letter presents an improved cue integration approach to reliably separate coherent moving objects from their background scene in video sequences. The proposed method uses a probabilistic framework to unify bottom-up and top-down cues in a parallel, “democratic” fashion. The algorithm makes use of a modified Bayes rule where each pixel's posterior probabilities of figure or ground layer assignment are derived from likelihood models of three bottom-up cues and a prior model provided by a top-down cue. Each cue is treated as independent evidence for figure-ground separation. They compete with and complement each other dynamically by adjusting relative weights from frame to frame according to cue quality measured against the overall integration. At the same time, the likelihood or prior models of individual cues adapt toward the integrated result. These mechanisms enable the system to organize under the influence of visual scene structure without manual intervention. A novel contribution here is the incorporation of a top-down cue. It improves the system's robustness and accuracy and helps handle difficult and ambiguous situations, such as abrupt lighting changes or occlusion among multiple objects. Results on various video sequences are demonstrated and discussed. (Video demos are available at http://organic.usc.edu:8376/∼tangx/neco/index.html .)


Video analytics plays a very important role in identification or detection and tracking of objects, this intern find application in many fields and domains. Novel learning methods or techniques built on Neural Networks requires larger dataset for training the results, the output obtained depends on how well the training is done. The proposed method of Weighted Cumulative Summation (WCS) is an approach based on background modelling to segment the moving objects. This method adapts and tunes the background variations instantaneously as the video frame arrives. The segmentation obtained is compared with other basic methods. The result obtained infers improvements in segmentation and in removal of ghost effect in the video. Extended Kalman Filter (EKF) is used to track the detector response. The responses of the detection from WCS are provided as input to EKF to track the moving object. The results are tabulated and represented in the form of graphs for analysis. The results are compared with three different video datasets and the results are noticeably good. The methods WCS can be used in the applications were data set is not available.


2020 ◽  
Vol 9 (4) ◽  
pp. 1394-1403
Author(s):  
Ehsan Akbari Sekehravani ◽  
Eduard Babulak ◽  
Mehdi Masoodi

Tracking of moving objects in a sequence of images is one of the important and functional branches of machine vision technology. Detection and tracking of a flying object with unknown features are important issues in detecting and tracking objects. This paper consists of two basic parts. The first part involves tracking multiple flying objects. At first, flying objects are detected and tracked, using the particle filter algorithm. The second part is to classify tracked objects (military or nonmilitary), based on four criteria; Size (center of mass) of objects, object speed vector, the direction of motion of objects, and thermal imagery identifies the type of tracked flying objects. To demonstrate the efficiency and the strength of the algorithm and the above system, several scenarios in different videos have been investigated that include challenges such as the number of objects (aircraft), different paths, the diverse directions of motion, different speeds and various objects. One of the most important challenges is the speed of processing and the angle of imaging.


2015 ◽  
Vol 734 ◽  
pp. 600-603
Author(s):  
Zhi Hai Sun ◽  
Bin Hu ◽  
Ying Meng ◽  
Wen Hui Zhou

Visual object detection and tracking have become an important step between computer vision and video analysis. Recent methods almost use mean shift for tracking problems, which are difficult to overcome the shortcoming with the initial object model. Initial object model is almost initialized manually by user, which is not smart enough and inconvenient. This paper considers the integration strategy for elliptical subtractive clustering and mean shift, and proposes a novel tracking method based on elliptical subtractive clustering.


Author(s):  
Othman Omran Khalifa ◽  
Norun Abdul Malek ◽  
Kazi Istiaque Ahmed

<span lang="EN-US">Detection of Moving Objects and Tracking is one of the most concerned issue and is being vastly used at home, business and modern applications. It is used to identify and track of an entity in a significant way. This paper illustrates the way to detect multiple objects using background subtraction methods and extract each object features by using Speed-Up Robust Feature algorithm and track the features through k-Nearest Neighbor processing from different surveillance videos sequentially. In the detection of object of each frame, pixel difference is calculated with respect to the reference background frame for the detection of an object which is only suitable for any ideal static condition with the consideration of lights from the environment. Thus, this method will detect the complete object and the extracted feature will be carried out for the tracking of the object in the multiple videos by one by one video. It is expected that this proposed method can commendably abolish the impact of the changing of lights</span>


2018 ◽  
Vol 2 (1) ◽  
Author(s):  
Fatima Ameen ◽  
Ziad Mohammed ◽  
Abdulrahman Siddiq

Tracking systems of moving objects provide a useful means to better control, manage and secure them. Tracking systems are used in different scales of applications such as indoors, outdoors and even used to track vehicles, ships and air planes moving over the globe. This paper presents the design and implementation of a system for tracking objects moving over a wide geographical area. The system depends on the Global Positioning System (GPS) and Global System for Mobile Communications (GSM) technologies without requiring the Internet service. The implemented system uses the freely available GPS service to determine the position of the moving objects. The tests of the implemented system in different regions and conditions show that the maximum uncertainty in the obtained positions is a circle with radius of about 16 m, which is an acceptable result for tracking the movement of objects in wide and open environments.


2020 ◽  
Vol 71 (7) ◽  
pp. 868-880
Author(s):  
Nguyen Hong-Quan ◽  
Nguyen Thuy-Binh ◽  
Tran Duc-Long ◽  
Le Thi-Lan

Along with the strong development of camera networks, a video analysis system has been become more and more popular and has been applied in various practical applications. In this paper, we focus on person re-identification (person ReID) task that is a crucial step of video analysis systems. The purpose of person ReID is to associate multiple images of a given person when moving in a non-overlapping camera network. Many efforts have been made to person ReID. However, most of studies on person ReID only deal with well-alignment bounding boxes which are detected manually and considered as the perfect inputs for person ReID. In fact, when building a fully automated person ReID system the quality of the two previous steps that are person detection and tracking may have a strong effect on the person ReID performance. The contribution of this paper are two-folds. First, a unified framework for person ReID based on deep learning models is proposed. In this framework, the coupling of a deep neural network for person detection and a deep-learning-based tracking method is used. Besides, features extracted from an improved ResNet architecture are proposed for person representation to achieve a higher ReID accuracy. Second, our self-built dataset is introduced and employed for evaluation of all three steps in the fully automated person ReID framework.


Sign in / Sign up

Export Citation Format

Share Document