Research on object detection algorithm based on deep learning for mobile Terminal

Abstract In order to improve object detection ability of robot, this study introduces an object detection algorithm which is based on deep learning. Firstly, a neural network that contains Convolution Layer, Pooling Layer and Fullly Connection Layer is designed to recognize whether the target object in image. Secondly, the Faster-RCNN, as a typical object detection algorithm, is used to detect the position of target object in image. Finally, the algorithm is transplanted on Raspberry Pi for deployment and testing. Results show that this algorithm can work well on Raspberry PI, and the mAP (mean average precision) and FPS (frames per second) reach 97% and 3. Meanwhile, the position of target object is accurately marked in image. Therefore, this study implies that the object detection algorithm based on deep learning can be implemented on mobile terminal and is helpful to improve object detection ability of mobile terminal.

Download Full-text

Deep Learning Object Detector Using a Combination of Convolutional Neural Network (CNN) Architecture (MiniVGGNet) and Classic Object Detection Algorithm

Pertanika Journal of Science and Technology ◽

10.47836/pjst.28.s2.13 ◽

2020 ◽

Vol 28 (S2) ◽

Author(s):

Asmida Ismail ◽

Siti Anom Ahmad ◽

Azura Che Soh ◽

Mohd Khair Hassan ◽

Hazreen Haizi Harith

Keyword(s):

Neural Network ◽

Deep Learning ◽

Object Detection ◽

Convolutional Neural Network ◽

Detection System ◽

Object Classification ◽

Detection Algorithm ◽

Learning Object ◽

Sliding Windows ◽

Detection Algorithms

The object detection system is a computer technology related to image processing and computer vision that detects instances of semantic objects of a certain class in digital images and videos. The system consists of two main processes, which are classification and detection. Once an object instance has been classified and detected, it is possible to obtain further information, including recognizes the specific instance, track the object over an image sequence and extract further information about the object and the scene. This paper presented an analysis performance of deep learning object detector by combining a deep learning Convolutional Neural Network (CNN) for object classification and applies classic object detection algorithms to devise our own deep learning object detector. MiniVGGNet is an architecture network used to train an object classification, and the data used for this purpose was collected from specific indoor environment building. For object detection, sliding windows and image pyramids were used to localize and detect objects at different locations, and non-maxima suppression (NMS) was used to obtain the final bounding box to localize the object location. Based on the experiment result, the percentage of classification accuracy of the network is 80% to 90% and the time for the system to detect the object is less than 15sec/frame. Experimental results show that there are reasonable and efficient to combine classic object detection method with a deep learning classification approach. The performance of this method can work in some specific use cases and effectively solving the problem of the inaccurate classification and detection of typical features.

Download Full-text

Weapon and Object Detection Using Mobile-Net SSD Model in Deep Neural Network

International Journal for Research in Applied Science and Engineering Technology ◽

10.22214/ijraset.2021.37622 ◽

2021 ◽

Vol 9 (8) ◽

pp. 1573-1582

Author(s):

S Gopi Naik

Keyword(s):

Neural Network ◽

Computer Vision ◽

Deep Learning ◽

Object Detection ◽

Convolutional Neural Network ◽

Optimization Algorithms ◽

Object Identification ◽

Integrated System ◽

Raspberry Pi ◽

High Level

Abstract: The plan is to establish an integrated system that can manage high-quality visual information and also detect weapons quickly and efficiently. It is obtained by integrating ARM-based computer vision and optimization algorithms with deep neural networks able to detect the presence of a threat. The whole system is connected to a Raspberry Pi module, which will capture live broadcasting and evaluate it using a deep convolutional neural network. Due to the intimate interaction between object identification and video and image analysis in real-time objects, By generating sophisticated ensembles that incorporate various low-level picture features with high-level information from object detection and scenario classifiers, their performance can quickly plateau. Deep learning models, which can learn semantic, high-level, deeper features, have been developed to overcome the issues that are present in optimization algorithms. It presents a review of deep learning based object detection frameworks that use Convolutional Neural Network layers for better understanding of object detection. The Mobile-Net SSD model behaves differently in network design, training methods, and optimization functions, among other things. The crime rate in suspicious areas has been reduced as a consequence of weapon detection. However, security is always a major concern in human life. The Raspberry Pi module, or computer vision, has been extensively used in the detection and monitoring of weapons. Due to the growing rate of human safety protection, privacy and the integration of live broadcasting systems which can detect and analyse images, suspicious areas are becoming indispensable in intelligence. This process uses a Mobile-Net SSD algorithm to achieve automatic weapons and object detection. Keywords: Computer Vision, Weapon and Object Detection, Raspberry Pi Camera, RTSP, SMTP, Mobile-Net SSD, CNN, Artificial Intelligence.

Download Full-text

Objek Deteksi Makanan Khas Palembang Menggunakan Algoritma YOLO (You Only Look Once)

Jurnal Nasional Ilmu Komputer ◽

10.47747/jurnalnik.v2i3.534 ◽

2021 ◽

Vol 2 (3) ◽

pp. 213-232

Author(s):

Lusiana Rahma ◽

Hadi Syaputra ◽

A.Haidar Mirza ◽

Susan Dian Purnamasari

Keyword(s):

Neural Network ◽

Deep Learning ◽

Object Detection ◽

Processing Speed ◽

Detection Algorithm ◽

Research Data ◽

Training Data ◽

Machine Learning Method ◽

Food Detection ◽

Artificial Neural Network Ann

Deep learning is a part of machine learning method that uses artificial neural network (ANN). The type of learning in deep learning can be supervised, semi-supervised, and unsupervised [7] . CNN & RNN (Supervised) and RBM & Autoencoder (Unsupervised) are deep learning algorithms. All of the above algorithms have uses in their respective fields, depending on what we want to use them for. One of the most frequently used cases for deep learning is object detection and classification. The Convolutional Neural Network (CNN) algorithm is the most widely used algorithm for object detection cases, one of the reasons because it is supported by Google's Tensorflow framework, but it turns out that there is one object detection algorithm that has a higher level of accuracy and processing speed, namely You Only Look Once (YOLO) which can run on 2 frameworks (Darknet & Darkflow) and is supported by GPU. That's why here the author prefers to do object detection with the You Only Look Once (YOLO) method. The research data with the title Palembang Food Detection Object Using the YOLO (You Only Look Once) Algorithm is a sample photo of food from Google Image. There are 31 types of Palembang specialties, each type consists of approximately 50 to 70 images, so the total images used are from 31 types of Palembang foods, namely 1955 images with jpeg format for training data, and 31 images with jpeg format typical Palembang foods for test data.

Download Full-text

Detection of sharp objects using deep neural network based object detection algorithm

2020 4th International Conference on Computer, Communication and Signal Processing (ICCCSP) ◽

10.1109/icccsp49186.2020.9315272 ◽

2020 ◽

Author(s):

R Kayalvizhi ◽

S Malarvizhi ◽

Siddhartha Dhar Choudhury ◽

Anita Topkar ◽

P Vijayakumar

Keyword(s):

Neural Network ◽

Object Detection ◽

Deep Neural Network ◽

Detection Algorithm

Download Full-text

Automatic Handgun Detection with Deep Learning in Video Surveillance Images

Applied Sciences ◽

10.3390/app11136085 ◽

2021 ◽

Vol 11 (13) ◽

pp. 6085

Author(s):

Jesus Salido ◽

Vanesa Lomas ◽

Jesus Ruiz-Santaquiteria ◽

Oscar Deniz

Keyword(s):

Neural Network ◽

Deep Learning ◽

Convolutional Neural Network ◽

Video Surveillance ◽

Automatic Detection ◽

Public Spaces ◽

Detection Methods ◽

Training Dataset ◽

Average Precision ◽

Terrorist Acts

There is a great need to implement preventive mechanisms against shootings and terrorist acts in public spaces with a large influx of people. While surveillance cameras have become common, the need for monitoring 24/7 and real-time response requires automatic detection methods. This paper presents a study based on three convolutional neural network (CNN) models applied to the automatic detection of handguns in video surveillance images. It aims to investigate the reduction of false positives by including pose information associated with the way the handguns are held in the images belonging to the training dataset. The results highlighted the best average precision (96.36%) and recall (97.23%) obtained by RetinaNet fine-tuned with the unfrozen ResNet-50 backbone and the best precision (96.23%) and F1 score values (93.36%) obtained by YOLOv3 when it was trained on the dataset including pose information. This last architecture was the only one that showed a consistent improvement—around 2%—when pose information was expressly considered during training.

Download Full-text

Dangerous Scenes Recognition During Hoisting Based on Faster Region-Based Convolutional Neural Network

Volume 2: Mechanics and Behavior of Active Materials; Structural Health Monitoring; Bioinspired Smart Materials and Systems; Energy Harvesting; Emerging Technologies ◽

10.1115/smasis2018-8226 ◽

2018 ◽

Author(s):

Hongguo Su ◽

Mingyuan Zhang ◽

Shengyuan Li ◽

Xuefeng Zhao

Keyword(s):

Neural Network ◽

Object Detection ◽

Convolutional Neural Network ◽

Image Classification ◽

State Of The Art ◽

Spatial Location ◽

Average Precision ◽

Practical Applications ◽

Security Enhancement ◽

Human Interactions

In the last couple of years, advancements in the deep learning, especially in convolutional neural networks, proved to be a boon for the image classification and recognition tasks. One of the important practical applications of object detection and image classification can be for security enhancement. If dangerous objects or scenes can be identified automatically, then a lot of accidents can be prevented. For this purpose, in this paper we made use of state-of-the-art implementation of Faster Region-based Convolutional Neural Network (Faster R-CNN) based on the monitoring video of hoisting sites to train a model to detect the dangerous object and the worker. By extracting the locations of them, object-human interactions during hoisting, mainly for changes in their spatial location relationship, can be understood whereby estimating whether the scene is safe or dangerous. Experimental results showed that the pre-trained model achieved good performance with a high mean average precision of 97.66% on object detection and the proposed method fulfilled the goal of dangerous scenes recognition perfectly.

Download Full-text

3D Object Detection Algorithm for Panoramic Images With Multi-Scale Convolutional Neural Network

IEEE Access ◽

10.1109/access.2019.2955995 ◽

2019 ◽

Vol 7 ◽

pp. 171461-171470

Author(s):

Dianwei Wang ◽

Yanhui He ◽

Ying Liu ◽

Daxiang Li ◽

Shiqian Wu ◽

...

Keyword(s):

Neural Network ◽

Object Detection ◽

Convolutional Neural Network ◽

Detection Algorithm ◽

3D Object ◽

Multi Scale ◽

Panoramic Images ◽

3D Object Detection

Download Full-text

GC-YOLOv3: You Only Look Once with Global Context Block

Electronics ◽

10.3390/electronics9081235 ◽

2020 ◽

Vol 9 (8) ◽

pp. 1235

Author(s):

Yang Yang ◽

Hongmin Deng

Keyword(s):

Object Detection ◽

Irrelevant Information ◽

Detection Algorithm ◽

Visual Object ◽

Detection Accuracy ◽

Feature Maps ◽

Average Precision ◽

Global Context ◽

Pascal Voc ◽

Feature Pyramid

In order to make the classification and regression of single-stage detectors more accurate, an object detection algorithm named Global Context You-Only-Look-Once v3 (GC-YOLOv3) is proposed based on the You-Only-Look-Once (YOLO) in this paper. Firstly, a better cascading model with learnable semantic fusion between a feature extraction network and a feature pyramid network is designed to improve detection accuracy using a global context block. Secondly, the information to be retained is screened by combining three different scaling feature maps together. Finally, a global self-attention mechanism is used to highlight the useful information of feature maps while suppressing irrelevant information. Experiments show that our GC-YOLOv3 reaches a maximum of 55.5 object detection mean Average Precision (mAP)@0.5 on Common Objects in Context (COCO) 2017 test-dev and that the mAP is 5.1% higher than that of the YOLOv3 algorithm on Pascal Visual Object Classes (PASCAL VOC) 2007 test set. Therefore, experiments indicate that the proposed GC-YOLOv3 model exhibits optimal performance on the PASCAL VOC and COCO datasets.

Download Full-text

MultiDefectNet: Multi-Class Defect Detection of Building Façade Based on Deep Convolutional Neural Network

Sustainability ◽

10.3390/su12229785 ◽

2020 ◽

Vol 12 (22) ◽

pp. 9785

Author(s):

Kisu Lee ◽

Goopyo Hong ◽

Lee Sael ◽

Sanghyo Lee ◽

Ha Young Kim

Keyword(s):

Neural Network ◽

Deep Learning ◽

Object Detection ◽

Convolutional Neural Network ◽

Defect Detection ◽

Structural Integrity ◽

Residential Building ◽

Training Environment ◽

Building Facade ◽

Building Facades

Defects in residential building façades affect the structural integrity of buildings and degrade external appearances. Defects in a building façade are typically managed using manpower during maintenance. This approach is time-consuming, yields subjective results, and can lead to accidents or casualties. To address this, we propose a building façade monitoring system that utilizes an object detection method based on deep learning to efficiently manage defects by minimizing the involvement of manpower. The dataset used for training a deep-learning-based network contains actual residential building façade images. Various building designs in these raw images make it difficult to detect defects because of their various types and complex backgrounds. We employed the faster regions with convolutional neural network (Faster R-CNN) structure for more accurate defect detection in such environments, achieving an average precision (intersection over union (IoU) = 0.5) of 62.7% for all types of trained defects. As it is difficult to detect defects in a training environment, it is necessary to improve the performance of the network. However, the object detection network employed in this study yields an excellent performance in complex real-world images, indicating the possibility of developing a system that would detect defects in more types of building façades.

Download Full-text

A Deep Learning-Based Fragment Detection Approach for the Arena Fragmentation Test

Applied Sciences ◽

10.3390/app10144744 ◽

2020 ◽

Vol 10 (14) ◽

pp. 4744

Author(s):

Hyukzae Lee ◽

Jonghee Kim ◽

Chanho Jung ◽

Yongchan Park ◽

Woong Park ◽

...

Keyword(s):

Image Processing ◽

Deep Learning ◽

Object Detection ◽

High Speed ◽

Detection Algorithm ◽

Learning Technologies ◽

Experimental Conditions ◽

Fragmentation Test ◽

Detection Approach ◽

Previous Image

The arena fragmentation test (AFT) is one of the tests used to design an effective warhead. Conventionally, complex and expensive measuring equipment is used for testing a warhead and measuring important factors such as the size, velocity, and the spatial distribution of fragments where the fragments penetrate steel target plates. In this paper, instead of using specific sensors and equipment, we proposed the use of a deep learning-based object detection algorithm to detect fragments in the AFT. To this end, we acquired many high-speed videos and built an AFT image dataset with bounding boxes of warhead fragments. Our method fine-tuned an existing object detection network named the Faster R-convolutional neural network (CNN) on this dataset with modification of the network’s anchor boxes. We also employed a novel temporal filtering method, which was demonstrated as an effective non-fragment filtering scheme in our recent previous image processing-based fragment detection approach, to capture only the first penetrating fragments from all detected fragments. We showed that the performance of the proposed method was comparable to that of a sensor-based system under the same experimental conditions. We also demonstrated that the use of deep learning technologies in the task of AFT significantly enhanced the performance via a quantitative comparison between our proposed method and our recent previous image processing-based method. In other words, our proposed method outperformed the previous image processing-based method. The proposed method produced outstanding results in terms of finding the exact fragment positions.

Download Full-text