Semi-supervised Deep Neural Networks for Object Detection in Video Surveillance Systems

This article presents an analysis of the effectiveness of object detection in digital images with the application of a limited quantity of input. The possibility of using a limited set of learning data was achieved by developing a detailed scenario of the task, which strictly defined the conditions of detector operation in the considered case of a convolutional neural network. The described solution utilizes known architectures of deep neural networks in the process of learning and object detection. The article presents comparisons of results from detecting the most popular deep neural networks while maintaining a limited training set composed of a specific number of selected images from diagnostic video. The analyzed input material was recorded during an inspection flight conducted along high-voltage lines. The object detector was built for a power insulator. The main contribution of the presented papier is the evidence that a limited training set (in our case, just 60 training frames) could be used for object detection, assuming an outdoor scenario with low variability of environmental conditions. The decision of which network will generate the best result for such a limited training set is not a trivial task. Conducted research suggests that the deep neural networks will achieve different levels of effectiveness depending on the amount of training data. The most beneficial results were obtained for two convolutional neural networks: the faster region-convolutional neural network (faster R-CNN) and the region-based fully convolutional network (R-FCN). Faster R-CNN reached the highest AP (average precision) at a level of 0.8 for 60 frames. The R-FCN model gained a worse AP result; however, it can be noted that the relationship between the number of input samples and the obtained results has a significantly lower influence than in the case of other CNN models, which, in the authors’ assessment, is a desired feature in the case of a limited training set.

Download Full-text

Real-Time Object Detection in Embedded Video Surveillance Systems

2008 Ninth International Workshop on Image Analysis for Multimedia Interactive Services ◽

10.1109/wiamis.2008.20 ◽

2008 ◽

Cited By ~ 9

Author(s):

Liliana Lo Presti ◽

Marco La Cascia

Keyword(s):

Object Detection ◽

Real Time ◽

Video Surveillance ◽

Surveillance Systems

Download Full-text

Contrast-Oriented Deep Neural Networks for Salient Object Detection

IEEE Transactions on Neural Networks and Learning Systems ◽

10.1109/tnnls.2018.2817540 ◽

2018 ◽

Vol 29 (12) ◽

pp. 6038-6051 ◽

Cited By ~ 30

Author(s):

Guanbin Li ◽

Yizhou Yu

Keyword(s):

Neural Networks ◽

Object Detection ◽

Deep Neural Networks ◽

Salient Object Detection ◽

Salient Object

Download Full-text

Deep Neural Networks Based Object Detection for Road Safety Using YOLO-V3

Smart Computing Techniques and Applications - Smart Innovation, Systems and Technologies ◽

10.1007/978-981-16-0878-0_71 ◽

2021 ◽

pp. 731-738

Author(s):

Jalaja Tattari ◽

Vineeth Reddy Donthi ◽

Dheeraj Mukirala ◽

S. Komal Kour

Keyword(s):

Neural Networks ◽

Object Detection ◽

Road Safety ◽

Deep Neural Networks

Download Full-text

Efficient Foreign Object Detection Between PSDs and Metro Doors via Deep Neural Networks

IEEE Access ◽

10.1109/access.2020.2978912 ◽

2020 ◽

Vol 8 ◽

pp. 46723-46734 ◽

Cited By ~ 1

Author(s):

Yuan Dai ◽

Weiming Liu ◽

Haiyu Li ◽

Lan Liu

Keyword(s):

Neural Networks ◽

Object Detection ◽

Deep Neural Networks ◽

Foreign Object

Download Full-text

A deep learning approach to building an intelligent video surveillance system

Multimedia Tools and Applications ◽

10.1007/s11042-020-09964-6 ◽

2020 ◽

Cited By ~ 1

Author(s):

Jie Xu

Keyword(s):

Face Recognition ◽

Object Detection ◽

Video Surveillance ◽

Surveillance System ◽

Video Surveillance System ◽

Surveillance Systems ◽

Single Shot ◽

Convolutional Networks ◽

Starting Point ◽

Intelligent Video Surveillance System

Abstract Recent advances in the field of object detection and face recognition have made it possible to develop practical video surveillance systems with embedded object detection and face recognition functionalities that are accurate and fast enough for commercial uses. In this paper, we compare some of the latest approaches to object detection and face recognition and provide reasons why they may or may not be amongst the best to be used in video surveillance applications in terms of both accuracy and speed. It is discovered that Faster R-CNN with Inception ResNet V2 is able to achieve some of the best accuracies while maintaining real-time rates. Single Shot Detector (SSD) with MobileNet, on the other hand, is incredibly fast and still accurate enough for most applications. As for face recognition, FaceNet with Multi-task Cascaded Convolutional Networks (MTCNN) achieves higher accuracy than advances such as DeepFace and DeepID2+ while being faster. An end-to-end video surveillance system is also proposed which could be used as a starting point for more complex systems. Various experiments have also been attempted on trained models with observations explained in detail. We finish by discussing video object detection and video salient object detection approaches which could potentially be used as future improvements to the proposed system.

Download Full-text