Design of a Scalable and Fast YOLO for Edge-Computing Devices

Byung-Gil Han; Joon-Goo Lee; Kil-Taek Lim; Doo-Hyun Choi

doi:10.3390/s20236779

Design of a Scalable and Fast YOLO for Edge-Computing Devices

Sensors ◽

10.3390/s20236779 ◽

2020 ◽

Vol 20 (23) ◽

pp. 6779

Author(s):

Byung-Gil Han ◽

Joon-Goo Lee ◽

Kil-Taek Lim ◽

Doo-Hyun Choi

Keyword(s):

Neural Network ◽

Object Detection ◽

Convolutional Neural Network ◽

Real Time ◽

Processing Speed ◽

Edge Computing ◽

Light Weight ◽

Detection Technology ◽

Accuracy Performance ◽

Speed And Accuracy

With the increase in research cases of the application of a convolutional neural network (CNN)-based object detection technology, studies on the light-weight CNN models that can be performed in real time on the edge-computing devices are also increasing. This paper proposed scalable convolutional blocks that can be easily designed CNN networks of You Only Look Once (YOLO) detector which have the balanced processing speed and accuracy of the target edge-computing devices considering different performances by exchanging the proposed blocks simply. The maximum number of kernels of the convolutional layer was determined through simple but intuitive speed comparison tests for three edge-computing devices to be considered. The scalable convolutional blocks were designed in consideration of the limited maximum number of kernels to detect objects in real time on these edge-computing devices. Three scalable and fast YOLO detectors (SF-YOLO) which designed using the proposed scalable convolutional blocks compared the processing speed and accuracy with several conventional light-weight YOLO detectors on the edge-computing devices. When compared with YOLOv3-tiny, SF-YOLO was seen to be 2 times faster than the previous processing speed but with the same accuracy as YOLOv3-tiny, and also, a 48% improved processing speed than the YOLOv3-tiny-PRN which is the processing speed improvement model. Also, even in the large SF-YOLO model that focuses on the accuracy performance, it achieved a 10% faster processing speed with better accuracy of 40.4% [email protected] in the MS COCO dataset than YOLOv4-tiny model.

Download Full-text

Research on Optimization of Object Detection Technology Based on Convolutional Neural Network

2020 13th International Symposium on Computational Intelligence and Design (ISCID) ◽

10.1109/iscid51228.2020.00010 ◽

2020 ◽

Author(s):

Yang Xue ◽

Huang Wanjun ◽

Yu Hongyang

Keyword(s):

Neural Network ◽

Object Detection ◽

Convolutional Neural Network ◽

Detection Technology

Download Full-text

The Design of Preventive Automated Driving Systems Based on Convolutional Neural Network

Electronics ◽

10.3390/electronics10141737 ◽

2021 ◽

Vol 10 (14) ◽

pp. 1737

Author(s):

Wooseop Lee ◽

Min-Hee Kang ◽

Jaein Song ◽

Keeyeon Hwang

Keyword(s):

Neural Network ◽

Object Detection ◽

Convolutional Neural Network ◽

Processing Speed ◽

Model Comparison ◽

Distance Estimation ◽

Visual Object ◽

Suitable Model ◽

Automated Vehicles ◽

Automated Driving

As automated vehicles have been considered one of the important trends in intelligent transportation systems, various research is being conducted to enhance their safety. In particular, the importance of technologies for the design of preventive automated driving systems, such as detection of surrounding objects and estimation of distance between vehicles. Object detection is mainly performed through cameras and LiDAR, but due to the cost and limits of LiDAR’s recognition distance, the need to improve Camera recognition technique, which is relatively convenient for commercialization, is increasing. This study learned convolutional neural network (CNN)-based faster regions with CNN (Faster R-CNN) and You Only Look Once (YOLO) V2 to improve the recognition techniques of vehicle-mounted monocular cameras for the design of preventive automated driving systems, recognizing surrounding vehicles in black box highway driving videos and estimating distances from surrounding vehicles through more suitable models for automated driving systems. Moreover, we learned the PASCAL visual object classes (VOC) dataset for model comparison. Faster R-CNN showed similar accuracy, with a mean average precision (mAP) of 76.4 to YOLO with a mAP of 78.6, but with a Frame Per Second (FPS) of 5, showing slower processing speed than YOLO V2 with an FPS of 40, and a Faster R-CNN, which we had difficulty detecting. As a result, YOLO V2, which shows better performance in accuracy and processing speed, was determined to be a more suitable model for automated driving systems, further progressing in estimating the distance between vehicles. For distance estimation, we conducted coordinate value conversion through camera calibration and perspective transform, set the threshold to 0.7, and performed object detection and distance estimation, showing more than 80% accuracy for near-distance vehicles. Through this study, it is believed that it will be able to help prevent accidents in automated vehicles, and it is expected that additional research will provide various accident prevention alternatives such as calculating and securing appropriate safety distances, depending on the vehicle types.

Download Full-text

Real-Time 3D object detection using improved convolutional neural network based on image-driven point cloud

Recent Advances in Electrical & Electronic Engineering (Formerly Recent Patents on Electrical & Electronic Engineering) ◽

10.2174/2352096514666211026142721 ◽

2021 ◽

Vol 14 ◽

Author(s):

Zhiyong Gao ◽

Jianhong Xiang

Keyword(s):

Neural Network ◽

Object Detection ◽

Convolutional Neural Network ◽

Real Time ◽

Point Cloud ◽

Point Clouds ◽

3D Point Cloud ◽

3D Object ◽

3D Object Detection ◽

Instance Segmentation

Background: While detecting the object directly from the 3D point cloud, the natural 3D patterns and invariance of 3D data are often obscure. Objective: In this work, we aimed at studying the 3D object detection from discrete, disordered and sparse 3D point clouds. Methods: The CNN is composed of the frustum sequence module, 3D instance segmentation module S-NET, 3D point cloud transformation module T-NET, and 3D boundary box estimation module E-NET. The search space of the object is determined by the frustum sequence module. The instance segmentation of the point cloud is performed by the 3D instance segmentation module. The 3D coordinates of the object are confirmed by the transformation module and the 3D bounding box estimation module. Results: Evaluated on KITTI benchmark dataset, our method outperforms the state of the art by remarkable margins while having real-time capability. Conclusion: We achieve real-time 3D object detection by proposing an improved convolutional neural network (CNN) based on image-driven point clouds.

Download Full-text

Tiny SSD: A Tiny Single-Shot Detection Deep Convolutional Neural Network for Real-Time Embedded Object Detection

2018 15th Conference on Computer and Robot Vision (CRV) ◽

10.1109/crv.2018.00023 ◽

2018 ◽

Cited By ~ 23

Author(s):

Alexander Womg ◽

Mohammad Javad Shafiee ◽

Francis Li ◽

Brendan Chwyl

Keyword(s):

Neural Network ◽

Object Detection ◽

Convolutional Neural Network ◽

Real Time ◽

Deep Convolutional Neural Network ◽

Single Shot ◽

Shot Detection

Download Full-text

A 3D Convolutional Neural Network Towards Real-Time Amodal 3D Object Detection

2018 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS) ◽

10.1109/iros.2018.8593837 ◽

2018 ◽

Author(s):

Hao Sun ◽

Zehui Meng ◽

Xinxin Du ◽

Marcelo H. Ang

Keyword(s):

Neural Network ◽

Object Detection ◽

Convolutional Neural Network ◽

Real Time ◽

3D Object ◽

3D Object Detection

Download Full-text

A Deep Lightweight Convolutional Neural Network Method for Real-Time Small Object Detection in Optical Remote Sensing Images

Sensing and Imaging ◽

10.1007/s11220-021-00348-0 ◽

2021 ◽

Vol 22 (1) ◽

Author(s):

Yanyong Han ◽

Yandong Han

Keyword(s):

Neural Network ◽

Remote Sensing ◽

Object Detection ◽

Convolutional Neural Network ◽

Real Time ◽

Optical Remote Sensing ◽

Small Object ◽

Remote Sensing Images ◽

Network Method ◽

Small Object Detection

Download Full-text

Real-Time Retail Smart Space Optimization and Personalized Store Assortment with Two-Stage Object Detection Using Faster Regional Convolutional Neural Network

Lecture Notes in Electrical Engineering - Advances in Computing and Network Communications ◽

10.1007/978-981-33-6987-0_33 ◽

2021 ◽

pp. 397-408

Author(s):

Nitin Vamsi Dantu ◽

Shriram K. Vasudevan

Keyword(s):

Neural Network ◽

Object Detection ◽

Convolutional Neural Network ◽

Real Time ◽

Smart Space ◽

Two Stage

Download Full-text

Edge computing-based real-time passenger counting using a compact convolutional neural network

Neural Computing and Applications ◽

10.1007/s00521-018-3894-2 ◽

2018 ◽

Vol 32 (9) ◽

pp. 4919-4931 ◽

Cited By ~ 1

Author(s):

Biao Yang ◽

Jinmeng Cao ◽

Xiaofeng Liu ◽

Nan Wang ◽

Jidong Lv

Keyword(s):

Neural Network ◽

Convolutional Neural Network ◽

Real Time ◽

Edge Computing

Download Full-text

Deep convolutional neural network for real time object detection using tensor flow

Materials Today Proceedings ◽

10.1016/j.matpr.2021.02.671 ◽

2021 ◽

Author(s):

G. Padmapriya ◽

B. Santhosh Kumar ◽

M.N. Kavitha ◽

V. Vennila

Keyword(s):

Neural Network ◽

Object Detection ◽

Convolutional Neural Network ◽

Real Time ◽

Deep Convolutional Neural Network

Download Full-text

Investigasi Pengaruh Step Training pada Metode Single Shot Multibox Detector untuk Marker dalam Teknologi Augmented Reality

Jurnal Ilmiah FIFO ◽

10.22441/fifo.2020.v12i1.001 ◽

2020 ◽

Vol 12 (1) ◽

pp. 1

Author(s):

Vivian Alfionita Sutama ◽

Suryo Adhi Wibowo ◽

Rissa Rahmania

Keyword(s):

Neural Network ◽

Artificial Intelligence ◽

Augmented Reality ◽

Object Detection ◽

Convolutional Neural Network ◽

Real Time ◽

Transfer Learning ◽

Learning Rate ◽

Batch Size ◽

Single Shot

Nowadays, Artificial Intelligence is one of the most developing technology, especially on Augmented Reality (AR). AR is a technology which connected between real world and virtual in a real time that allows user to interact directly and display it in 3D. AR technology has two methods, that are AR based on marker and AR based on markerless. However, AR based on marker need an object detection system which has high performance as an interaction tools between user and the device. Single shot multibox detector (SSD) is an object detection algorithm that has fast learning computation and good performance. This method is affected by some parameters like number of epoch, learning rate, batch size, step training, etc. However, to create a good system it took a long process such as taking dataset, labelling process, then training and testing models to gain the best performance. In this experiment, we analyze SSD method in AR technology using inception architecture as pre-trained Convolutional neural network (CNN), and then do transfer learning to minimize amount training time. The configuration that used is the number of step training. The result of this experiment gets the best accuracy in 70.17%. Then, the best performance is used as an object detection model for marker’s AR technology.Abstrak Saat ini, Artificial intelligence merupakan teknologi yang sedang berkembang pesat. Salah satunya adalah teknologi Augmented Reality (AR). AR adalah teknologi yang menggabungkan dunia nyata dengan virtual secara real-time dengan interaksi pengguna secara langsung dan menampilkannya dalam bentuk 3D. Teknologi AR ini memiliki dua metode yaitu dengan marker dan markerless. Dalam perkembangannya, AR berbasis marker membutuhkan sistem deteksi objek yang memiliki performa tinggi sebagai alat interaksi antara pengguna dengan perangkatnya. Single shot multibox detector (SSD) merupakan algoritma deteksi objek yang memiliki komputasi pembelajaran dan kinerja yang baik. Metode ini dipengaruhi oleh beberapa parameter seperti jumlah lapisan konvolusi, epoch, learning rate, jumlah batch, step training, dll. Namun, dalam mengimplementasikannya diperlukan proses yang cukup panjang seperti, pengambilan dataset, proses pelabelan, proses pelatihan menggunakan metode SSD, dan melakukan pengujian terhadap beberapa model untuk mencari perfomansi paling baik. Dalam percobaan ini, kami melakukan analisis terhadap metode SSD pada teknologi AR menggunakan arsitektur Inception sebagai pre-trained Convolutional neural network (CNN), kemudian dilakukan transfer learning untuk memperkecil jumlah kelas data pelatihan dan waktu pelatihan data. Konfigurasi yang digunakan berupa jumlah step pada pelatihan. Hasil dari penilitian ini menunjukan akurasi terbaik sebesar 70,17%. Kemudian, perfomansi terbaik digunakan sebagai model deteksi objek untuk marker pada teknologi AR.

Download Full-text