Convolutional neural network and its pretrained models for image classification and object detection: A survey

Author(s):  
Biswajit Jena ◽  
Gopal Krishna Nayak ◽  
Sanjay Saxena
Author(s):  
Hongguo Su ◽  
Mingyuan Zhang ◽  
Shengyuan Li ◽  
Xuefeng Zhao

In the last couple of years, advancements in the deep learning, especially in convolutional neural networks, proved to be a boon for the image classification and recognition tasks. One of the important practical applications of object detection and image classification can be for security enhancement. If dangerous objects or scenes can be identified automatically, then a lot of accidents can be prevented. For this purpose, in this paper we made use of state-of-the-art implementation of Faster Region-based Convolutional Neural Network (Faster R-CNN) based on the monitoring video of hoisting sites to train a model to detect the dangerous object and the worker. By extracting the locations of them, object-human interactions during hoisting, mainly for changes in their spatial location relationship, can be understood whereby estimating whether the scene is safe or dangerous. Experimental results showed that the pre-trained model achieved good performance with a high mean average precision of 97.66% on object detection and the proposed method fulfilled the goal of dangerous scenes recognition perfectly.


2021 ◽  
Vol 4 (2) ◽  
pp. 286-293
Author(s):  
Asrianda Asrianda ◽  
Hafizh Al Kautsar Aidilof ◽  
Yoga Pangestu

Artificial intelligence (AI) merupakan bidang ilmu pengetahuan yang saat ini menjadi isu yang menarik dan masih diteliti secara luas. Salah satu cabang dari pengembangan AI adalah computer vision yang di dalamnya terdapat topik pembahasan image classification dan object detection. Machine learning dapat dimanfaatkan di dalam bidang computer vision untuk melakukan object detection dan image classification, yaitu dengan menggunakan algoritma Convolutional Neural Network (CNN). CNN banyak digunakan pada penelitian terdahulu karena akurasinya yang tinggi. Pada penelitian ini, CNN digunakan untuk mendeteksi jenis penyakit daun tanaman kelapa sawit, dengan dataset sebanyak 60 gambar, dimana 50 diantaranya merupakan daun dengan 5 jenis penyakit berbeda, yaitu Curvularia sp, Cochliobolus carbonus, Capnodium sp, Drecshlera, dan defisiensi unsur hara. Sedangkan 10 sisanya merupakan gambar daun sehat. Hasilnya, CNN dapat mendeteksi penyakit daun kelapa sawit dengan akurasi yang dihasilkan mencapai 99%.


2020 ◽  
Vol 34 (07) ◽  
pp. 13001-13008 ◽  
Author(s):  
Zhun Zhong ◽  
Liang Zheng ◽  
Guoliang Kang ◽  
Shaozi Li ◽  
Yi Yang

In this paper, we introduce Random Erasing, a new data augmentation method for training the convolutional neural network (CNN). In training, Random Erasing randomly selects a rectangle region in an image and erases its pixels with random values. In this process, training images with various levels of occlusion are generated, which reduces the risk of over-fitting and makes the model robust to occlusion. Random Erasing is parameter learning free, easy to implement, and can be integrated with most of the CNN-based recognition models. Albeit simple, Random Erasing is complementary to commonly used data augmentation techniques such as random cropping and flipping, and yields consistent improvement over strong baselines in image classification, object detection and person re-identification. Code is available at: https://github.com/zhunzhong07/Random-Erasing.


2021 ◽  
Vol 18 (1) ◽  
pp. 172988142199332
Author(s):  
Xintao Ding ◽  
Boquan Li ◽  
Jinbao Wang

Indoor object detection is a very demanding and important task for robot applications. Object knowledge, such as two-dimensional (2D) shape and depth information, may be helpful for detection. In this article, we focus on region-based convolutional neural network (CNN) detector and propose a geometric property-based Faster R-CNN method (GP-Faster) for indoor object detection. GP-Faster incorporates geometric property in Faster R-CNN to improve the detection performance. In detail, we first use mesh grids that are the intersections of direct and inverse proportion functions to generate appropriate anchors for indoor objects. After the anchors are regressed to the regions of interest produced by a region proposal network (RPN-RoIs), we then use 2D geometric constraints to refine the RPN-RoIs, in which the 2D constraint of every classification is a convex hull region enclosing the width and height coordinates of the ground-truth boxes on the training set. Comparison experiments are implemented on two indoor datasets SUN2012 and NYUv2. Since the depth information is available in NYUv2, we involve depth constraints in GP-Faster and propose 3D geometric property-based Faster R-CNN (DGP-Faster) on NYUv2. The experimental results show that both GP-Faster and DGP-Faster increase the performance of the mean average precision.


Sign in / Sign up

Export Citation Format

Share Document