Optimized convolutional neural network architectures for efficient on-device vision-based object detection

Neural Computing and Applications ◽

10.1007/s00521-021-06830-w ◽

2021 ◽

Author(s):

Ivan Rodriguez-Conde ◽

Celso Campos ◽

Florentino Fdez-Riverola

Keyword(s):

Neural Networks ◽

Object Detection ◽

Convolutional Neural Networks ◽

Detection Accuracy ◽

Network Architectures ◽

Structural Level ◽

Embedded Devices ◽

Remote Services ◽

The Cost ◽

Accuracy Speed

AbstractConvolutional neural networks have pushed forward image analysis research and computer vision over the last decade, constituting a state-of-the-art approach in object detection today. The design of increasingly deeper and wider architectures has made it possible to achieve unprecedented levels of detection accuracy, albeit at the cost of both a dramatic computational burden and a large memory footprint. In such a context, cloud systems have become a mainstream technological solution due to their tremendous scalability, providing researchers and practitioners with virtually unlimited resources. However, these resources are typically made available as remote services, requiring communication over the network to be accessed, thus compromising the speed of response, availability, and security of the implemented solution. In view of these limitations, the on-device paradigm has emerged as a recent yet widely explored alternative, pursuing more compact and efficient networks to ultimately enable the execution of the derived models directly on resource-constrained client devices. This study provides an up-to-date review of the more relevant scientific research carried out in this vein, circumscribed to the object detection problem. In particular, the paper contributes to the field with a comprehensive architectural overview of both the existing lightweight object detection frameworks targeted to mobile and embedded devices, and the underlying convolutional neural networks that make up their internal structure. More specifically, it addresses the main structural-level strategies used for conceiving the various components of a detection pipeline (i.e., backbone, neck, and head), as well as the most salient techniques proposed for adapting such structures and the resulting architectures to more austere deployment environments. Finally, the study concludes with a discussion of the specific challenges and next steps to be taken to move toward a more convenient accuracy–speed trade-off.

Download Full-text

Multiple-Oriented and Small Object Detection with Convolutional Neural Networks for Aerial Image

Remote Sensing ◽

10.3390/rs11182176 ◽

2019 ◽

Vol 11 (18) ◽

pp. 2176 ◽

Cited By ~ 3

Author(s):

Chen ◽

Zhong ◽

Tan

Keyword(s):

Neural Networks ◽

Object Detection ◽

Convolutional Neural Networks ◽

Aerial Images ◽

Superior Performance ◽

Aerial Image ◽

Detection Accuracy ◽

Small Object ◽

Data Set ◽

Orientation Information

Detecting objects in aerial images is a challenging task due to multiple orientations and relatively small size of the objects. Although many traditional detection models have demonstrated an acceptable performance by using the imagery pyramid and multiple templates in a sliding-window manner, such techniques are inefficient and costly. Recently, convolutional neural networks (CNNs) have successfully been used for object detection, and they have demonstrated considerably superior performance than that of traditional detection methods; however, this success has not been expanded to aerial images. To overcome such problems, we propose a detection model based on two CNNs. One of the CNNs is designed to propose many object-like regions that are generated from the feature maps of multi scales and hierarchies with the orientation information. Based on such a design, the positioning of small size objects becomes more accurate, and the generated regions with orientation information are more suitable for the objects arranged with arbitrary orientations. Furthermore, another CNN is designed for object recognition; it first extracts the features of each generated region and subsequently makes the final decisions. The results of the extensive experiments performed on the vehicle detection in aerial imagery (VEDAI) and overhead imagery research data set (OIRDS) datasets indicate that the proposed model performs well in terms of not only the detection accuracy but also the detection speed.

Download Full-text

Impact of Light Flickering on Object Detection Accuracy using Convolutional Neural Networks

2021 Telecoms Conference (ConfTELE) ◽

10.1109/conftele50222.2021.9435506 ◽

2021 ◽

Author(s):

Samuel Carvalho ◽

Jacqueline Humphries ◽

Nathan Dunne ◽

Shauna Leahy

Keyword(s):

Neural Networks ◽

Object Detection ◽

Convolutional Neural Networks ◽

Detection Accuracy

Download Full-text

Road Characteristics Detection Based on Joint Convolutional Neural Networks with Adaptive Squares

ISPRS International Journal of Geo-Information ◽

10.3390/ijgi10060377 ◽

2021 ◽

Vol 10 (6) ◽

pp. 377

Author(s):

Chiao-Ling Kuo ◽

Ming-Hua Tsai

Keyword(s):

Neural Networks ◽

Convolutional Neural Networks ◽

Autonomous Vehicles ◽

Detection Accuracy ◽

Geospatial Information ◽

Combination Rules ◽

And Performance ◽

Road Characteristics ◽

Machine Readable ◽

Background Image

The importance of road characteristics has been highlighted, as road characteristics are fundamental structures established to support many transportation-relevant services. However, there is still huge room for improvement in terms of types and performance of road characteristics detection. With the advantage of geographically tiled maps with high update rates, remarkable accessibility, and increasing availability, this paper proposes a novel simple deep-learning-based approach, namely joint convolutional neural networks (CNNs) adopting adaptive squares with combination rules to detect road characteristics from roadmap tiles. The proposed joint CNNs are responsible for the foreground and background image classification and various types of road characteristics classification from previous foreground images, raising detection accuracy. The adaptive squares with combination rules help efficiently focus road characteristics, augmenting the ability to detect them and provide optimal detection results. Five types of road characteristics—crossroads, T-junctions, Y-junctions, corners, and curves—are exploited, and experimental results demonstrate successful outcomes with outstanding performance in reality. The information of exploited road characteristics with location and type is, thus, converted from human-readable to machine-readable, the results will benefit many applications like feature point reminders, road condition reports, or alert detection for users, drivers, and even autonomous vehicles. We believe this approach will also enable a new path for object detection and geospatial information extraction from valuable map tiles.

Download Full-text

Data Augmentation Methods Applying Grayscale Images for Convolutional Neural Networks in Machine Vision

Applied Sciences ◽

10.3390/app11156721 ◽

2021 ◽

Vol 11 (15) ◽

pp. 6721

Author(s):

Jinyeong Wang ◽

Sanghwan Lee

Keyword(s):

Neural Networks ◽

Machine Vision ◽

Object Detection ◽

Image Classification ◽

Convolutional Neural Networks ◽

Data Augmentation ◽

Image Data ◽

Manufacturing Productivity ◽

Smart Factories ◽

Grayscale Images

In increasing manufacturing productivity with automated surface inspection in smart factories, the demand for machine vision is rising. Recently, convolutional neural networks (CNNs) have demonstrated outstanding performance and solved many problems in the field of computer vision. With that, many machine vision systems adopt CNNs to surface defect inspection. In this study, we developed an effective data augmentation method for grayscale images in CNN-based machine vision with mono cameras. Our method can apply to grayscale industrial images, and we demonstrated outstanding performance in the image classification and the object detection tasks. The main contributions of this study are as follows: (1) We propose a data augmentation method that can be performed when training CNNs with industrial images taken with mono cameras. (2) We demonstrate that image classification or object detection performance is better when training with the industrial image data augmented by the proposed method. Through the proposed method, many machine-vision-related problems using mono cameras can be effectively solved by using CNNs.

Download Full-text

Learning Rotation-Invariant Convolutional Neural Networks for Object Detection in VHR Optical Remote Sensing Images

IEEE Transactions on Geoscience and Remote Sensing ◽

10.1109/tgrs.2016.2601622 ◽

2016 ◽

Vol 54 (12) ◽

pp. 7405-7415 ◽

Cited By ~ 612

Author(s):

Gong Cheng ◽

Peicheng Zhou ◽

Junwei Han

Keyword(s):

Remote Sensing ◽

Neural Networks ◽

Object Detection ◽

Convolutional Neural Networks ◽

Optical Remote Sensing ◽

Remote Sensing Images ◽

Rotation Invariant

Download Full-text

A shape preserving approach for salient object detection using convolutional neural networks

2016 23rd International Conference on Pattern Recognition (ICPR) ◽

10.1109/icpr.2016.7899701 ◽

2016 ◽

Cited By ~ 4

Author(s):

Jongpil Kim ◽

Vladimir Pavlovic

Keyword(s):

Neural Networks ◽

Object Detection ◽

Convolutional Neural Networks ◽

Salient Object Detection ◽

Salient Object ◽

Shape Preserving

Download Full-text

End-to-end object detection and recognition in forward-looking sonar images with convolutional neural networks

2016 IEEE/OES Autonomous Underwater Vehicles (AUV) ◽

10.1109/auv.2016.7778662 ◽

2016 ◽

Cited By ~ 8

Author(s):

Matias Valdenegro-Toro

Keyword(s):

Neural Networks ◽

Object Detection ◽

Convolutional Neural Networks ◽

Sonar Images ◽

End To End ◽

Detection And Recognition ◽

Forward Looking

Download Full-text

Object Detection and Mapping with Unmanned Aerial Vehicles Using Convolutional Neural Networks

Lecture Notes of the Institute for Computer Sciences, Social Informatics and Telecommunications Engineering - Future Access Enablers for Ubiquitous and Intelligent Infrastructures ◽

10.1007/978-3-030-78459-1_19 ◽

2021 ◽

pp. 254-267

Author(s):

Stefan Hensel ◽

Marin B. Marinov ◽

Max Schmitt

Keyword(s):

Neural Networks ◽

Object Detection ◽

Unmanned Aerial Vehicles ◽

Convolutional Neural Networks ◽

Aerial Vehicles

Download Full-text

ObjectFusion: An object detection and segmentation framework with RGB-D SLAM and convolutional neural networks

Neurocomputing ◽

10.1016/j.neucom.2019.01.088 ◽

2019 ◽

Vol 345 ◽

pp. 3-14 ◽

Cited By ~ 7

Author(s):

Guanzhong Tian ◽

Liang Liu ◽

JongHyok Ri ◽

Yong Liu ◽

Yiran Sun

Keyword(s):

Neural Networks ◽

Object Detection ◽

Convolutional Neural Networks ◽

Segmentation Framework

Download Full-text

Object Detection by a Super-Resolution Method and a Convolutional Neural Networks

2018 IEEE International Conference on Big Data (Big Data) ◽

10.1109/bigdata.2018.8622135 ◽

2018 ◽

Cited By ~ 1

Author(s):

Bokyoon Na ◽

Geoffrey C Fox

Keyword(s):

Neural Networks ◽

Object Detection ◽

Convolutional Neural Networks ◽

Super Resolution ◽

Resolution Method

Download Full-text