Object detection in real time based on improved single shot multi-box detector algorithm

Abstract In today’s scenario, the fastest algorithm which uses a single layer of convolutional network to detect the objects from the image is single shot multi-box detector (SSD) algorithm. This paper studies object detection techniques to detect objects in real time on any device running the proposed model in any environment. In this paper, we have increased the classification accuracy of detecting objects by improving the SSD algorithm while keeping the speed constant. These improvements have been done in their convolutional layers, by using depth-wise separable convolution along with spatial separable convolutions generally called multilayer convolutional neural networks. The proposed method uses these multilayer convolutional neural networks to develop a system model which consists of multilayers to classify the given objects into any of the defined classes. The schemes then use multiple images and detect the objects from these images, labeling them with their respective class label. To speed up the computational performance, the proposed algorithm is applied along with the multilayer convolutional neural network which uses a larger number of default boxes and results in more accurate detection. The accuracy in detecting the objects is checked by different parameters such as loss function, frames per second (FPS), mean average precision (mAP), and aspect ratio. Experimental results confirm that our proposed improved SSD algorithm has high accuracy.

Download Full-text

Dual-Resolution Dual-Path Convolutional Neural Networks for Fast Object Detection

Sensors ◽

10.3390/s19143111 ◽

2019 ◽

Vol 19 (14) ◽

pp. 3111 ◽

Cited By ~ 1

Author(s):

Jing Pan ◽

Hanqing Sun ◽

Zhanjie Song ◽

Jungong Han

Keyword(s):

Neural Networks ◽

Object Detection ◽

Convolutional Neural Networks ◽

Visual Object ◽

Single Shot ◽

Feature Maps ◽

Robotic Vision ◽

Detection Algorithms ◽

Speed Up ◽

Small Feature

Downsampling input images is a simple trick to speed up visual object-detection algorithms, especially on robotic vision and applied mobile vision systems. However, this trick comes with a significant decline in accuracy. In this paper, dual-resolution dual-path Convolutional Neural Networks (CNNs), named DualNets, are proposed to bump up the accuracy of those detection applications. In contrast to previous methods that simply downsample the input images, DualNets explicitly take dual inputs in different resolutions and extract complementary visual features from these using dual CNN paths. The two paths in a DualNet are a backbone path and an auxiliary path that accepts larger inputs and then rapidly downsamples them to relatively small feature maps. With the help of the carefully designed auxiliary CNN paths in DualNets, auxiliary features are extracted from the larger input with controllable computation. Auxiliary features are then fused with the backbone features using a proposed progressive residual fusion strategy to enrich feature representation.This architecture, as the feature extractor, is further integrated with the Single Shot Detector (SSD) to accomplish latency-sensitive visual object-detection tasks. We evaluate the resulting detection pipeline on Pascal VOC and MS COCO benchmarks. Results show that the proposed DualNets can raise the accuracy of those CNN detection applications that are sensitive to computation payloads.

Download Full-text

Jet Single Shot Detection

EPJ Web of Conferences ◽

10.1051/epjconf/202125104027 ◽

2021 ◽

Vol 251 ◽

pp. 04027

Author(s):

Adrian Alan Pol ◽

Thea Aarrestad ◽

Katya Govorkova ◽

Roi Halily ◽

Tal Kopetz ◽

...

Keyword(s):

Neural Networks ◽

Large Hadron Collider ◽

Object Detection ◽

Convolutional Neural Networks ◽

Hadron Collider ◽

Single Shot ◽

Cern Large Hadron Collider ◽

Detection Techniques ◽

Shot Detection ◽

Channel Dependent

We apply object detection techniques based on Convolutional Neural Networks to jet reconstruction and identification at the CERN Large Hadron Collider. In particular, we focus on CaloJet reconstruction, representing each event as an image composed of calorimeter cells and using a Single Shot Detection network, called Jet-SSD. The model performs simultaneous localization and classification and additional regression tasks to measure jet features. We investigate TernaryWeight Networks with weights constrained to {-1, 0, 1} times a layer- and channel-dependent scaling factors. We show that the quantized version of the network closely matches the performance of its full-precision equivalent.

Download Full-text

Identification of Tomato Disease Types and Detection of Infected Areas Based on Deep Convolutional Neural Networks and Object Detection Techniques

Computational Intelligence and Neuroscience ◽

10.1155/2019/9142753 ◽

2019 ◽

Vol 2019 ◽

pp. 1-15 ◽

Cited By ~ 6

Author(s):

Qimei Wang ◽

Feng Qi ◽

Minghe Sun ◽

Jianhua Qu ◽

Jie Xue

Keyword(s):

Neural Networks ◽

Object Detection ◽

Convolutional Neural Networks ◽

Detection Methods ◽

Disease Detection ◽

Deep Convolutional Neural Networks ◽

Detection Techniques ◽

Tomato Diseases ◽

Validation Set ◽

Tomato Disease

This study develops tomato disease detection methods based on deep convolutional neural networks and object detection models. Two different models, Faster R-CNN and Mask R-CNN, are used in these methods, where Faster R-CNN is used to identify the types of tomato diseases and Mask R-CNN is used to detect and segment the locations and shapes of the infected areas. To select the model that best fits the tomato disease detection task, four different deep convolutional neural networks are combined with the two object detection models. Data are collected from the Internet and the dataset is divided into a training set, a validation set, and a test set used in the experiments. The experimental results show that the proposed models can accurately and quickly identify the eleven tomato disease types and segment the locations and shapes of the infected areas.

Download Full-text

Convolutional neural networks for near real-time object detection from UAV imagery in avalanche search and rescue operations

2016 IEEE International Geoscience and Remote Sensing Symposium (IGARSS) ◽

10.1109/igarss.2016.7729174 ◽

2016 ◽

Cited By ~ 16

Author(s):

Mesay Belete Bejiga ◽

Abdallah Zeggada ◽

Farid Melgani

Keyword(s):

Neural Networks ◽

Object Detection ◽

Real Time ◽

Convolutional Neural Networks ◽

Search And Rescue

Download Full-text

Road Object Detection: A Comparative Study of Deep Learning-Based Algorithms

Electronics ◽

10.3390/electronics10161932 ◽

2021 ◽

Vol 10 (16) ◽

pp. 1932

Author(s):

Malik Haris ◽

Adam Glowacz

Keyword(s):

Image Processing ◽

Deep Learning ◽

Object Detection ◽

Real Time ◽

Large Scale ◽

Single Shot ◽

Automated Driving ◽

Convolutional Network ◽

Image Processing Algorithms ◽

Processing Algorithms

Automated driving and vehicle safety systems need object detection. It is important that object detection be accurate overall and robust to weather and environmental conditions and run in real-time. As a consequence of this approach, they require image processing algorithms to inspect the contents of images. This article compares the accuracy of five major image processing algorithms: Region-based Fully Convolutional Network (R-FCN), Mask Region-based Convolutional Neural Networks (Mask R-CNN), Single Shot Multi-Box Detector (SSD), RetinaNet, and You Only Look Once v4 (YOLOv4). In this comparative analysis, we used a large-scale Berkeley Deep Drive (BDD100K) dataset. Their strengths and limitations are analyzed based on parameters such as accuracy (with/without occlusion and truncation), computation time, precision-recall curve. The comparison is given in this article helpful in understanding the pros and cons of standard deep learning-based algorithms while operating under real-time deployment restrictions. We conclude that the YOLOv4 outperforms accurately in detecting difficult road target objects under complex road scenarios and weather conditions in an identical testing environment.

Download Full-text

Rich feature hierarchies for accurate object detection and semantic segmentation

Radioengineering ◽

10.18127/j00338486-202109-11 ◽

2021 ◽

pp. 115-126

Author(s):

A.Y. Virasova ◽

D.I. Klimov ◽

O.E. Khromov ◽

I.R. Gubaidullin ◽

V.V. Oreshko

Keyword(s):

Neural Networks ◽

Object Detection ◽

Convolutional Neural Networks ◽

Hardware Implementation ◽

Training Data ◽

Fine Tuning ◽

Preliminary Training ◽

Detection Techniques ◽

Foreign Production ◽

Element Base

Formulation of the problem. Over the past few years, there has been little progress in object detection techniques. The most efficient are complex computational ensemble methods, which usually combine several low-level image properties with high-level properties. However, every day interest in artificial intelligence is growing, and the idea of using neural networks on board a spacecraft, with the possibility of making decisions and issuing one-time commands, is very promising, since it makes it possible to analyze a large data stream in real time without resorting to ground station, thereby not losing information when transmitting a packet. The purpose of the work is to conduct research on the possibility of effective use of modern models of neural networks, to develop a methodology for their use in the problem of object detection and analysis of the element base for hardware implementation with the possibility of using convolutional neural networks for thermovideotelemetry on board a spacecraft. Results of work. An approach has been formulated that combines two key ideas: 1) application of high-throughput convolutional neural networks for downward processing of image regions in order to localize and segment objects; 2) preliminary training for the auxiliary task, followed by fine tuning of the domain, which gives a significant increase in performance in the case when the training data is insufficient. The analysis of the element base for the possibility of hardware implementation of neural networks on board a spacecraft using electrical radio products of domestic and foreign production is carried out. Practical significance. The efficiency of preliminary network training for an auxiliary task is shown, followed by fine tuning of the subject area. A technique is described that makes it possible to increase the average accuracy of detecting objects in an image by more than 30%. The analysis of the existing element base, the possibility of hardware implementation of neural networks on board the spacecraft using electrical radio products of domestic and foreign production, as well as the criteria for selecting key elements.

Download Full-text

Real-Time Object Detection for Millimeter-Wave Images Based on Improved Faster Regions with Convolutional Neural Networks

Laser & Optoelectronics Progress ◽

10.3788/lop56.131009 ◽

2019 ◽

Vol 56 (13) ◽

pp. 131009

Author(s):

侯冰基 Bingji Hou ◽

杨明辉 Minghui Yang ◽

孙晓玮 Xiaowei Sun

Keyword(s):

Neural Networks ◽

Object Detection ◽

Real Time ◽

Millimeter Wave ◽

Convolutional Neural Networks

Download Full-text

SqueezeDet: Unified, Small, Low Power Fully Convolutional Neural Networks for Real-Time Object Detection for Autonomous Driving

2017 IEEE Conference on Computer Vision and Pattern Recognition Workshops (CVPRW) ◽

10.1109/cvprw.2017.60 ◽

2017 ◽

Cited By ~ 93

Author(s):

Bichen Wu ◽

Alvin Wan ◽

Forrest Iandola ◽

Peter H. Jin ◽

Kurt Keutzer

Keyword(s):

Neural Networks ◽

Low Power ◽

Object Detection ◽

Real Time ◽

Convolutional Neural Networks ◽

Autonomous Driving ◽

Fully Convolutional Neural Networks

Download Full-text

Corrigendum to “Identification of Tomato Disease Types and Detection of Infected Areas Based on Deep Convolutional Neural Networks and Object Detection Techniques”

Computational Intelligence and Neuroscience ◽

10.1155/2021/3751479 ◽

2021 ◽

Vol 2021 ◽

pp. 1-1

Author(s):

Qimei Wang ◽

Feng Qi ◽

Minghe Sun ◽

Jianhua Qu ◽

Jie Xue

Keyword(s):

Neural Networks ◽

Object Detection ◽

Convolutional Neural Networks ◽

Deep Convolutional Neural Networks ◽

Detection Techniques ◽

Tomato Disease

Download Full-text

Evaluation and Evolution of Object Detection Techniques YOLO and R-CNN

International Journal of Recent Technology and Engineering - 2 ◽

10.35940/ijrte.b1154.0782s319 ◽

2019 ◽

Vol 8 (2S3) ◽

pp. 824-829 ◽

Cited By ~ 2

Keyword(s):

Neural Networks ◽

Image Processing ◽

Problem Solving ◽

Object Detection ◽

Convolutional Neural Networks ◽

Mean Average Precision ◽

Average Precision ◽

Trade Off ◽

Detection Techniques

Object detection has boomed in areas like image processing in accordance with the unparalleled development of CNN (Convolutional Neural Networks) over the last decade. The CNN family which includes R-CNN has advanced to much faster versions like Fast-RCNN which have mean average precision(Map) of up to 76.4 but their frames per second(fps) still remain between 5 to 18 and that is comparatively moderate to problem-solving time. Therefore, there is an urgent need to increase speed in the advancements of object detection. In accordance with the broad initiation of CNN and its features, this paper discusses YOLO (You only look once), a strong representative of CNN which comes up with an entirely different method of interpreting the task of detecting the objects. YOLO has attained fast speeds with fps of 155 and map of about 78.6, thereby surpassing the performances of other CNN versions appreciably. Furthermore, in comparison with the latest advancements, YOLOv2 attains an outstanding trade-off between accuracy and speed and also as a detector possessing powerful generalization capabilities of representing an entire image

Download Full-text