Convolutional neural network and its pretrained models for image classification and object detection: A survey

In the last couple of years, advancements in the deep learning, especially in convolutional neural networks, proved to be a boon for the image classification and recognition tasks. One of the important practical applications of object detection and image classification can be for security enhancement. If dangerous objects or scenes can be identified automatically, then a lot of accidents can be prevented. For this purpose, in this paper we made use of state-of-the-art implementation of Faster Region-based Convolutional Neural Network (Faster R-CNN) based on the monitoring video of hoisting sites to train a model to detect the dangerous object and the worker. By extracting the locations of them, object-human interactions during hoisting, mainly for changes in their spatial location relationship, can be understood whereby estimating whether the scene is safe or dangerous. Experimental results showed that the pre-trained model achieved good performance with a high mean average precision of 97.66% on object detection and the proposed method fulfilled the goal of dangerous scenes recognition perfectly.

Download Full-text

Machine Learning for Detection of Palm Oil Leaf Disease Visually using Convolutional Neural Network Algorithm

JOURNAL OF INFORMATICS AND TELECOMMUNICATION ENGINEERING ◽

10.31289/jite.v4i2.4185 ◽

2021 ◽

Vol 4 (2) ◽

pp. 286-293

Author(s):

Asrianda Asrianda ◽

Hafizh Al Kautsar Aidilof ◽

Yoga Pangestu

Keyword(s):

Neural Network ◽

Artificial Intelligence ◽

Machine Learning ◽

Computer Vision ◽

Object Detection ◽

Convolutional Neural Network ◽

Image Classification ◽

Palm Oil ◽

Leaf Disease ◽

Neural Network Algorithm

Artificial intelligence (AI) merupakan bidang ilmu pengetahuan yang saat ini menjadi isu yang menarik dan masih diteliti secara luas. Salah satu cabang dari pengembangan AI adalah computer vision yang di dalamnya terdapat topik pembahasan image classification dan object detection. Machine learning dapat dimanfaatkan di dalam bidang computer vision untuk melakukan object detection dan image classification, yaitu dengan menggunakan algoritma Convolutional Neural Network (CNN). CNN banyak digunakan pada penelitian terdahulu karena akurasinya yang tinggi. Pada penelitian ini, CNN digunakan untuk mendeteksi jenis penyakit daun tanaman kelapa sawit, dengan dataset sebanyak 60 gambar, dimana 50 diantaranya merupakan daun dengan 5 jenis penyakit berbeda, yaitu Curvularia sp, Cochliobolus carbonus, Capnodium sp, Drecshlera, dan defisiensi unsur hara. Sedangkan 10 sisanya merupakan gambar daun sehat. Hasilnya, CNN dapat mendeteksi penyakit daun kelapa sawit dengan akurasi yang dihasilkan mencapai 99%.

Download Full-text

Image Classification and Object Detection Algorithm Based on Convolutional Neural Network

Science Insights ◽

10.15354/si.19.re117 ◽

2019 ◽

Vol 31 (1) ◽

pp. 85-100

Author(s):

Juan K. Leonard ◽

Keyword(s):

Neural Network ◽

Object Detection ◽

Convolutional Neural Network ◽

Image Classification ◽

Detection Algorithm

Download Full-text

Random Erasing Data Augmentation

Proceedings of the AAAI Conference on Artificial Intelligence ◽

10.1609/aaai.v34i07.7000 ◽

2020 ◽

Vol 34 (07) ◽

pp. 13001-13008 ◽

Cited By ~ 32

Author(s):

Zhun Zhong ◽

Liang Zheng ◽

Guoliang Kang ◽

Shaozi Li ◽

Yi Yang

Keyword(s):

Neural Network ◽

Object Detection ◽

Convolutional Neural Network ◽

Image Classification ◽

Data Augmentation ◽

Parameter Learning ◽

Identification Code ◽

Training Images ◽

Augmentation Techniques

In this paper, we introduce Random Erasing, a new data augmentation method for training the convolutional neural network (CNN). In training, Random Erasing randomly selects a rectangle region in an image and erases its pixels with random values. In this process, training images with various levels of occlusion are generated, which reduces the risk of over-fitting and makes the model robust to occlusion. Random Erasing is parameter learning free, easy to implement, and can be integrated with most of the CNN-based recognition models. Albeit simple, Random Erasing is complementary to commonly used data augmentation techniques such as random cropping and flipping, and yields consistent improvement over strong baselines in image classification, object detection and person re-identification. Code is available at: https://github.com/zhunzhong07/Random-Erasing.

Download Full-text

Fused Random Pooling in Convolutional Neural Network for Herbal Plants Image Classification

International Journal of Advanced Trends in Computer Science and Engineering ◽

10.30534/ijatcse/2019/87862019 ◽

2019 ◽

Vol 8 (6) ◽

pp. 3208-3214

Author(s):

Ian Val P. Delos Reyes ◽

Keyword(s):

Neural Network ◽

Convolutional Neural Network ◽

Image Classification ◽

Herbal Plants

Download Full-text

Improving Convolutional Neural Network (CNN) Architecture (miniVGGNet) with Batch Normalization and Learning Rate Decay Factor for Image Classification

International Journal of Integrated Engineering ◽

10.30880/ijie.2019.11.04.006 ◽

2019 ◽

Vol 11 (4) ◽

Author(s):

Asmida Ismail ◽

◽

Siti Anom Ahmad ◽

Azura Che Soh ◽

Khair Hassan ◽

...

Keyword(s):

Neural Network ◽

Convolutional Neural Network ◽

Image Classification ◽

Learning Rate ◽

Decay Factor ◽

Batch Normalization ◽

Rate Decay

Download Full-text

Image Classification Method Based on Supplement Convolutional Neural Network

Journal of Computer-Aided Design & Computer Graphics ◽

10.3724/sp.j.1089.2018.16322 ◽

2018 ◽

Vol 30 (3) ◽

pp. 385 ◽

Cited By ~ 3

Author(s):

Qiang Wang ◽

Xiaojie Li ◽

Jun Chen

Keyword(s):

Neural Network ◽

Convolutional Neural Network ◽

Image Classification ◽

Classification Method

Download Full-text

Research on Optimization of Object Detection Technology Based on Convolutional Neural Network

2020 13th International Symposium on Computational Intelligence and Design (ISCID) ◽

10.1109/iscid51228.2020.00010 ◽

2020 ◽

Author(s):

Yang Xue ◽

Huang Wanjun ◽

Yu Hongyang

Keyword(s):

Neural Network ◽

Object Detection ◽

Convolutional Neural Network ◽

Detection Technology

Download Full-text

Geometric property-based convolutional neural network for indoor object detection

International Journal of Advanced Robotic Systems ◽

10.1177/1729881421993323 ◽

2021 ◽

Vol 18 (1) ◽

pp. 172988142199332

Author(s):

Xintao Ding ◽

Boquan Li ◽

Jinbao Wang

Keyword(s):

Neural Network ◽

Object Detection ◽

Convolutional Neural Network ◽

Geometric Property ◽

Ground Truth ◽

Geometric Constraints ◽

Depth Information ◽

Training Set ◽

Object Knowledge ◽

The Mean

Indoor object detection is a very demanding and important task for robot applications. Object knowledge, such as two-dimensional (2D) shape and depth information, may be helpful for detection. In this article, we focus on region-based convolutional neural network (CNN) detector and propose a geometric property-based Faster R-CNN method (GP-Faster) for indoor object detection. GP-Faster incorporates geometric property in Faster R-CNN to improve the detection performance. In detail, we first use mesh grids that are the intersections of direct and inverse proportion functions to generate appropriate anchors for indoor objects. After the anchors are regressed to the regions of interest produced by a region proposal network (RPN-RoIs), we then use 2D geometric constraints to refine the RPN-RoIs, in which the 2D constraint of every classification is a convex hull region enclosing the width and height coordinates of the ground-truth boxes on the training set. Comparison experiments are implemented on two indoor datasets SUN2012 and NYUv2. Since the depth information is available in NYUv2, we involve depth constraints in GP-Faster and propose 3D geometric property-based Faster R-CNN (DGP-Faster) on NYUv2. The experimental results show that both GP-Faster and DGP-Faster increase the performance of the mean average precision.

Download Full-text