Research on Vehicle Object Detection Method Based on Convolutional Neural Network

The performance of convolutional neural network- (CNN-) based object detection has achieved incredible success. Howbeit, existing CNN-based algorithms suffer from a problem that small-scale objects are difficult to detect because it may have lost its response when the feature map has reached a certain depth, and it is common that the scale of objects (such as cars, buses, and pedestrians) contained in traffic images and videos varies greatly. In this paper, we present a 32-layer multibranch convolutional neural network named MBNet for fast detecting objects in traffic scenes. Our model utilizes three detection branches, in which feature maps with a size of 16 × 16, 32 × 32, and 64 × 64 are used, respectively, to optimize the detection for large-, medium-, and small-scale objects. By means of a multitask loss function, our model can be trained end-to-end. The experimental results show that our model achieves state-of-the-art performance in terms of precision and recall rate, and the detection speed (up to 33 fps) is fast, which can meet the real-time requirements of industry.

Download Full-text

A novel directional object detection method for piled objects using a hybrid region-based convolutional neural network

Advanced Engineering Informatics ◽

10.1016/j.aei.2021.101448 ◽

2022 ◽

Vol 51 ◽

pp. 101448

Author(s):

Ming-Chuan Chiu ◽

Ho-Yen Tsai ◽

Jing-Er Chiu

Keyword(s):

Neural Network ◽

Object Detection ◽

Convolutional Neural Network ◽

Detection Method ◽

Hybrid Region

Download Full-text

Investigasi pengaruh Step Training pada Skema Same-Padding untuk Metode Faster R-CNN dalam Teknologi Augmented Reality

Jurnal Ilmiah FIFO ◽

10.22441//fifo.2020.v12i2.002 ◽

2021 ◽

Vol 12 (2) ◽

pp. 128

Author(s):

Anky Aditya P ◽

Suryo Adhi Wibowo ◽

Rissa Rahmania

Keyword(s):

Neural Network ◽

Augmented Reality ◽

Object Detection ◽

Convolutional Neural Network ◽

Particle Filter ◽

Real World ◽

Detection Method ◽

Environment Interaction ◽

System Configuration ◽

Network Region

Abstract Augmented Reality (AR) is a technology with the concept of combining real-world dimensions with virtual world dimensions that are displayed in realtime. In the AR environment, interaction techniques used can vary. Marker-based AR is one type of AR that allows virtual objects to be displayed in the real world by using markers as pointers. In the use of marker-based AR required object detection method used for tracking markers. In this study, a system that can detect objects in the form of fingertips will be designed. In designing the system the Faster Region-based Convolutional Neural Network (Faster R-CNN) method is used. R-CNN Faster is an object detection method which is a combination of the Fast R-CNN method and the Region Proposal Network (RPN). The results of the detection parameters will be used for tracking, namely the coordinates x, y, width, and length. This research uses the Faster R-CNN method because it has a faster computing speed compared to the previous method, namely Particle Filter. The Faster R-CNN method uses ResNet architecture as the core of CNN. The system configuration to be tested is the 25K, 50K and 75K step training with the same-padding scheme. The testing process is taken from a video consisting of 10800 training data and 3600 test data. The best system configuration based on parameter priority for AR technology is obtained in the 50K step training.Keyword: augmented reality, convolutional neural network, faster region-based convolutional neural network, region proposal network, ResNet.Abstrak Augmented Reality (AR) adalah teknologi dengan konsep menggabungkan dimensi dunia nyata dengan dimensi dunia virtual yang ditampilkan secara real-time. Dalam lingkungan AR, teknik interaksi yang digunakan dapat bermacam – macam. Marker-based AR merupakan salah satu jenis AR yang memungkinkan objek virtual ditampilkan ke dalam dunia nyata dengan digunakannya marker sebagai pointer-nya. Dalam penggunaan AR berbasis marker diperlukan metode deteksi objek yang digunakan untuk tracking marker. Dalam penelitian ini akan dirancang sebuah sistem yang dapat mendeteksi objek berupa ujung jari. Dalam perancangan sistem tersebut digunakan metode Faster Region-Based Convolutional Nueral Network (Faster R-CNN). Faster R-CNN merupakan salah satu metode deteksi objek yang merupakan gabungan dari metode Fast R-CNN dan Region Proposal Network (RPN). Hasil dari parameter deteksi akan digunakan untuk tracking, yaitu koordinat x, y, width, dan length. Penelitian ini menggunakan metode Faster R-CNN karena memiliki kecepatan komputasi yang lebih cepat dibandingkan dengan metode sebelumnya yaitu Particle Filter. Metode Faster R-CNN mengunakan arsitektur ResNet sebagai inti dari CNN. Konfigurasi sistem yang akan diuji adalah step training 25K, 50K dan 75K dengan skema same-padding. Proses pengujian diambil dari video yang terdiri dari 10800 data latih dan 3600 data uji. Konfigurasi sistem terbaik berdasarkan prioritas parameter untuk teknologi AR didapatkan pada step training 50K.Keyword: augmented reality, convolutional neural network, faster region-based convolutional neural network, region proposal network, ResNet.

Download Full-text

Research on Optimization of Object Detection Technology Based on Convolutional Neural Network

2020 13th International Symposium on Computational Intelligence and Design (ISCID) ◽

10.1109/iscid51228.2020.00010 ◽

2020 ◽

Author(s):

Yang Xue ◽

Huang Wanjun ◽

Yu Hongyang

Keyword(s):

Neural Network ◽

Object Detection ◽

Convolutional Neural Network ◽

Detection Technology

Download Full-text

Geometric property-based convolutional neural network for indoor object detection

International Journal of Advanced Robotic Systems ◽

10.1177/1729881421993323 ◽

2021 ◽

Vol 18 (1) ◽

pp. 172988142199332

Author(s):

Xintao Ding ◽

Boquan Li ◽

Jinbao Wang

Keyword(s):

Neural Network ◽

Object Detection ◽

Convolutional Neural Network ◽

Geometric Property ◽

Ground Truth ◽

Geometric Constraints ◽

Depth Information ◽

Training Set ◽

Object Knowledge ◽

The Mean

Indoor object detection is a very demanding and important task for robot applications. Object knowledge, such as two-dimensional (2D) shape and depth information, may be helpful for detection. In this article, we focus on region-based convolutional neural network (CNN) detector and propose a geometric property-based Faster R-CNN method (GP-Faster) for indoor object detection. GP-Faster incorporates geometric property in Faster R-CNN to improve the detection performance. In detail, we first use mesh grids that are the intersections of direct and inverse proportion functions to generate appropriate anchors for indoor objects. After the anchors are regressed to the regions of interest produced by a region proposal network (RPN-RoIs), we then use 2D geometric constraints to refine the RPN-RoIs, in which the 2D constraint of every classification is a convex hull region enclosing the width and height coordinates of the ground-truth boxes on the training set. Comparison experiments are implemented on two indoor datasets SUN2012 and NYUv2. Since the depth information is available in NYUv2, we involve depth constraints in GP-Faster and propose 3D geometric property-based Faster R-CNN (DGP-Faster) on NYUv2. The experimental results show that both GP-Faster and DGP-Faster increase the performance of the mean average precision.

Download Full-text

A Real-time Driver Fatigue Detection Method Based on Two-Stage Convolutional Neural Network

IFAC-PapersOnLine ◽

10.1016/j.ifacol.2020.12.2357 ◽

2020 ◽

Vol 53 (2) ◽

pp. 15374-15379

Author(s):

Hu He ◽

Xiaoyong Zhang ◽

Fu Jiang ◽

Chenglong Wang ◽

Yingze Yang ◽

...

Keyword(s):

Neural Network ◽

Convolutional Neural Network ◽

Real Time ◽

Detection Method ◽

Driver Fatigue ◽

Two Stage ◽

Fatigue Detection

Download Full-text

The Design of Preventive Automated Driving Systems Based on Convolutional Neural Network

Electronics ◽

10.3390/electronics10141737 ◽

2021 ◽

Vol 10 (14) ◽

pp. 1737

Author(s):

Wooseop Lee ◽

Min-Hee Kang ◽

Jaein Song ◽

Keeyeon Hwang

Keyword(s):

Neural Network ◽

Object Detection ◽

Convolutional Neural Network ◽

Processing Speed ◽

Model Comparison ◽

Distance Estimation ◽

Visual Object ◽

Suitable Model ◽

Automated Vehicles ◽

Automated Driving

As automated vehicles have been considered one of the important trends in intelligent transportation systems, various research is being conducted to enhance their safety. In particular, the importance of technologies for the design of preventive automated driving systems, such as detection of surrounding objects and estimation of distance between vehicles. Object detection is mainly performed through cameras and LiDAR, but due to the cost and limits of LiDAR’s recognition distance, the need to improve Camera recognition technique, which is relatively convenient for commercialization, is increasing. This study learned convolutional neural network (CNN)-based faster regions with CNN (Faster R-CNN) and You Only Look Once (YOLO) V2 to improve the recognition techniques of vehicle-mounted monocular cameras for the design of preventive automated driving systems, recognizing surrounding vehicles in black box highway driving videos and estimating distances from surrounding vehicles through more suitable models for automated driving systems. Moreover, we learned the PASCAL visual object classes (VOC) dataset for model comparison. Faster R-CNN showed similar accuracy, with a mean average precision (mAP) of 76.4 to YOLO with a mAP of 78.6, but with a Frame Per Second (FPS) of 5, showing slower processing speed than YOLO V2 with an FPS of 40, and a Faster R-CNN, which we had difficulty detecting. As a result, YOLO V2, which shows better performance in accuracy and processing speed, was determined to be a more suitable model for automated driving systems, further progressing in estimating the distance between vehicles. For distance estimation, we conducted coordinate value conversion through camera calibration and perspective transform, set the threshold to 0.7, and performed object detection and distance estimation, showing more than 80% accuracy for near-distance vehicles. Through this study, it is believed that it will be able to help prevent accidents in automated vehicles, and it is expected that additional research will provide various accident prevention alternatives such as calculating and securing appropriate safety distances, depending on the vehicle types.

Download Full-text

Spectral Convolution Feature-Based SPD Matrix Representation for Signal Detection Using a Deep Neural Network

Entropy ◽

10.3390/e22090949 ◽

2020 ◽

Vol 22 (9) ◽

pp. 949

Author(s):

Jiangyi Wang ◽

Min Liu ◽

Xinwu Zeng ◽

Xiaoqiang Hua

Keyword(s):

Neural Network ◽

Signal Detection ◽

Convolutional Neural Network ◽

Deep Neural Network ◽

Detection Method ◽

Learning Algorithm ◽

Simulated Data ◽

Data Sets ◽

Feature Maps ◽

Simulated Data Sets

Convolutional neural networks have powerful performances in many visual tasks because of their hierarchical structures and powerful feature extraction capabilities. SPD (symmetric positive definition) matrix is paid attention to in visual classification, because it has excellent ability to learn proper statistical representation and distinguish samples with different information. In this paper, a deep neural network signal detection method based on spectral convolution features is proposed. In this method, local features extracted from convolutional neural network are used to construct the SPD matrix, and a deep learning algorithm for the SPD matrix is used to detect target signals. Feature maps extracted by two kinds of convolutional neural network models are applied in this study. Based on this method, signal detection has become a binary classification problem of signals in samples. In order to prove the availability and superiority of this method, simulated and semi-physical simulated data sets are used. The results show that, under low SCR (signal-to-clutter ratio), compared with the spectral signal detection method based on the deep neural network, this method can obtain a gain of 0.5–2 dB on simulated data sets and semi-physical simulated data sets.

Download Full-text

SAR Image Change Detection Method via a Pyramid Pooling Convolutional Neural Network

IGARSS 2020 - 2020 IEEE International Geoscience and Remote Sensing Symposium ◽

10.1109/igarss39084.2020.9324109 ◽

2020 ◽

Author(s):

Rongfang Wang ◽

Fan Ding ◽

Jia-Wei Chen ◽

Bo Liu ◽

Jie Zhang ◽

...

Keyword(s):

Neural Network ◽

Change Detection ◽

Convolutional Neural Network ◽

Detection Method ◽

Sar Image ◽

Image Change Detection

Download Full-text

Research on Vehicle Object Detection Method Based on Convolutional Neural Network

Abandoned Object Detection Method Using Convolutional Neural Network

A Multibranch Object Detection Method for Traffic Scenes

A novel directional object detection method for piled objects using a hybrid region-based convolutional neural network

Investigasi pengaruh Step Training pada Skema Same-Padding untuk Metode Faster R-CNN dalam Teknologi Augmented Reality

Research on Optimization of Object Detection Technology Based on Convolutional Neural Network

Geometric property-based convolutional neural network for indoor object detection

A Real-time Driver Fatigue Detection Method Based on Two-Stage Convolutional Neural Network

The Design of Preventive Automated Driving Systems Based on Convolutional Neural Network

Spectral Convolution Feature-Based SPD Matrix Representation for Signal Detection Using a Deep Neural Network

SAR Image Change Detection Method via a Pyramid Pooling Convolutional Neural Network

Export Citation Format