Dangerous Scenes Recognition During Hoisting Based on Faster Region-Based Convolutional Neural Network

In the last couple of years, advancements in the deep learning, especially in convolutional neural networks, proved to be a boon for the image classification and recognition tasks. One of the important practical applications of object detection and image classification can be for security enhancement. If dangerous objects or scenes can be identified automatically, then a lot of accidents can be prevented. For this purpose, in this paper we made use of state-of-the-art implementation of Faster Region-based Convolutional Neural Network (Faster R-CNN) based on the monitoring video of hoisting sites to train a model to detect the dangerous object and the worker. By extracting the locations of them, object-human interactions during hoisting, mainly for changes in their spatial location relationship, can be understood whereby estimating whether the scene is safe or dangerous. Experimental results showed that the pre-trained model achieved good performance with a high mean average precision of 97.66% on object detection and the proposed method fulfilled the goal of dangerous scenes recognition perfectly.

Download Full-text

Robust place recognition based on salient landmarks screening and convolutional neural network features

International Journal of Advanced Robotic Systems ◽

10.1177/1729881420966966 ◽

2020 ◽

Vol 17 (6) ◽

pp. 172988142096696

Author(s):

Jie Niu ◽

Kun Qian

Keyword(s):

Neural Network ◽

Convolutional Neural Network ◽

State Of The Art ◽

Environmental Modeling ◽

Superior Performance ◽

Similarity Measurement ◽

Natural Environments ◽

Place Recognition ◽

Average Precision ◽

Specific Object

In this work, we propose a robust place recognition measurement in natural environments based on salient landmark screening and convolutional neural network (CNN) features. First, the salient objects in the image are segmented as candidate landmarks. Then, a category screening network is designed to remove specific object types that are not suitable for environmental modeling. Finally, a three-layer CNN is used to get highly representative features of the salient landmarks. In the similarity measurement, a Siamese network is chosen to calculate the similarity between images. Experiments were conducted on three challenging benchmark place recognition datasets and superior performance was achieved compared to other state-of-the-art methods, including FABMAP, SeqSLAM, SeqCNNSLAM, and PlaceCNN. Our method obtains the best results on the precision–recall curves, and the average precision reaches 78.43%, which is the best of the comparison methods. This demonstrates that the CNN features on the screened salient landmarks can be against a strong viewpoint and condition variations.

Download Full-text

Investigation of optimal configurations of a convolutional neural network for the identification of objects in real-time

Information Technology and Nanotechnology ◽

10.18287/1613-0073-2019-2416-417-423 ◽

2019 ◽

pp. 417-423

Author(s):

M A Isayev ◽

D A Savelyev

Keyword(s):

Neural Network ◽

Neural Networks ◽

Deep Learning ◽

Convolutional Neural Network ◽

Real Time ◽

State Of The Art ◽

Average Precision ◽

The Core ◽

Particular Solution ◽

Optimal Configurations

The comparison of different convolutional neural networks which are the core of the most actual solutions in the computer vision area is considers in hhe paper. The study includes benchmarks of this state-of-the-art solutions by some criteria, such as mAP (mean average precision), FPS (frames per seconds), for the possibility of real-time usability. It is concluded on the best convolutional neural network model and deep learning methods that were used at particular solution.

Download Full-text

Image Classification for Vehicle Type Dataset Using State-of-the-art Convolutional Neural Network Architecture

Proceedings of the 2018 Artificial Intelligence and Cloud Computing Conference on ZZZ - AICCC '18 ◽

10.1145/3299819.3299822 ◽

2018 ◽

Author(s):

Yian Seo ◽

Kyung-shik Shin

Keyword(s):

Neural Network ◽

Convolutional Neural Network ◽

Image Classification ◽

Network Architecture ◽

State Of The Art ◽

Neural Network Architecture ◽

Vehicle Type

Download Full-text

IoU-Adaptive Deformable R-CNN: Make Full Use of IoU for Multi-Class Object Detection in Remote Sensing Imagery

Remote Sensing ◽

10.3390/rs11030286 ◽

2019 ◽

Vol 11 (3) ◽

pp. 286 ◽

Cited By ~ 24

Author(s):

Jiangqiao Yan ◽

Hongqi Wang ◽

Menglong Yan ◽

Wenhui Diao ◽

Xian Sun ◽

...

Keyword(s):

Neural Network ◽

Remote Sensing ◽

Object Detection ◽

Convolutional Neural Network ◽

State Of The Art ◽

Ground Truth ◽

Detection Performance ◽

Candidate Region ◽

Detection Accuracy ◽

Remote Sensing Images

Recently, methods based on Faster region-based convolutional neural network (R-CNN)have been popular in multi-class object detection in remote sensing images due to their outstandingdetection performance. The methods generally propose candidate region of interests (ROIs) througha region propose network (RPN), and the regions with high enough intersection-over-union (IoU)values against ground truth are treated as positive samples for training. In this paper, we find thatthe detection result of such methods is sensitive to the adaption of different IoU thresholds. Specially,detection performance of small objects is poor when choosing a normal higher threshold, while alower threshold will result in poor location accuracy caused by a large quantity of false positives.To address the above issues, we propose a novel IoU-Adaptive Deformable R-CNN framework formulti-class object detection. Specially, by analyzing the different roles that IoU can play in differentparts of the network, we propose an IoU-guided detection framework to reduce the loss of small objectinformation during training. Besides, the IoU-based weighted loss is designed, which can learn theIoU information of positive ROIs to improve the detection accuracy effectively. Finally, the class aspectratio constrained non-maximum suppression (CARC-NMS) is proposed, which further improves theprecision of the results. Extensive experiments validate the effectiveness of our approach and weachieve state-of-the-art detection performance on the DOTA dataset.

Download Full-text

A Multibranch Object Detection Method for Traffic Scenes

Computational Intelligence and Neuroscience ◽

10.1155/2019/3679203 ◽

2019 ◽

Vol 2019 ◽

pp. 1-16

Author(s):

Jiangfan Feng ◽

Fanjie Wang ◽

Siqin Feng ◽

Yongrong Peng

Keyword(s):

Neural Network ◽

Object Detection ◽

Convolutional Neural Network ◽

Detection Method ◽

State Of The Art ◽

Recall Rate ◽

Small Scale ◽

Feature Maps ◽

Time Requirements ◽

Speed Up

The performance of convolutional neural network- (CNN-) based object detection has achieved incredible success. Howbeit, existing CNN-based algorithms suffer from a problem that small-scale objects are difficult to detect because it may have lost its response when the feature map has reached a certain depth, and it is common that the scale of objects (such as cars, buses, and pedestrians) contained in traffic images and videos varies greatly. In this paper, we present a 32-layer multibranch convolutional neural network named MBNet for fast detecting objects in traffic scenes. Our model utilizes three detection branches, in which feature maps with a size of 16 × 16, 32 × 32, and 64 × 64 are used, respectively, to optimize the detection for large-, medium-, and small-scale objects. By means of a multitask loss function, our model can be trained end-to-end. The experimental results show that our model achieves state-of-the-art performance in terms of precision and recall rate, and the detection speed (up to 33 fps) is fast, which can meet the real-time requirements of industry.

Download Full-text

Research on abnormal object detection in specific region based on Mask R-CNN

International Journal of Advanced Robotic Systems ◽

10.1177/1729881420925287 ◽

2020 ◽

Vol 17 (3) ◽

pp. 172988142092528

Author(s):

Haitao Xiong ◽

Jiaqing Wu ◽

Qing Liu ◽

Yuanyuan Cai

Keyword(s):

Neural Network ◽

Object Detection ◽

Convolutional Neural Network ◽

State Of The Art ◽

Specific Region ◽

Logistics Management ◽

Original Image ◽

Network Method ◽

Exceptional Object ◽

Instance Segmentation

As an information carrier with rich semantics, image plays an increasingly important role in real-time monitoring of logistics management. Abnormal objects are typically closely related to the specific region. Detecting abnormal objects in the specific region is conducive to improving the accuracy of detection and analysis, thereby improving the level of logistics management. Motivated by these observations, we design the method called abnormal object detection in a specific region based on Mask R-convolutional neural network: Abnormal Object Detection in Specific Region. In this method, the initial instance segmentation model is obtained by the traditional Mask R-convolutional neural network method, then the region overlap of the specific region is calculated and the overlapping ratio of each instance is determined, and these two parts of information are fused to predict the exceptional object. Finally, the abnormal object is restored and detected in the original image. Experimental results demonstrate that our proposed Abnormal Object Detection in Specific Region can effectively identify abnormal objects in a specific region and significantly outperforms the state-of-the-art methods.

Download Full-text

Machine Learning for Detection of Palm Oil Leaf Disease Visually using Convolutional Neural Network Algorithm

JOURNAL OF INFORMATICS AND TELECOMMUNICATION ENGINEERING ◽

10.31289/jite.v4i2.4185 ◽

2021 ◽

Vol 4 (2) ◽

pp. 286-293

Author(s):

Asrianda Asrianda ◽

Hafizh Al Kautsar Aidilof ◽

Yoga Pangestu

Keyword(s):

Neural Network ◽

Artificial Intelligence ◽

Machine Learning ◽

Computer Vision ◽

Object Detection ◽

Convolutional Neural Network ◽

Image Classification ◽

Palm Oil ◽

Leaf Disease ◽

Neural Network Algorithm

Artificial intelligence (AI) merupakan bidang ilmu pengetahuan yang saat ini menjadi isu yang menarik dan masih diteliti secara luas. Salah satu cabang dari pengembangan AI adalah computer vision yang di dalamnya terdapat topik pembahasan image classification dan object detection. Machine learning dapat dimanfaatkan di dalam bidang computer vision untuk melakukan object detection dan image classification, yaitu dengan menggunakan algoritma Convolutional Neural Network (CNN). CNN banyak digunakan pada penelitian terdahulu karena akurasinya yang tinggi. Pada penelitian ini, CNN digunakan untuk mendeteksi jenis penyakit daun tanaman kelapa sawit, dengan dataset sebanyak 60 gambar, dimana 50 diantaranya merupakan daun dengan 5 jenis penyakit berbeda, yaitu Curvularia sp, Cochliobolus carbonus, Capnodium sp, Drecshlera, dan defisiensi unsur hara. Sedangkan 10 sisanya merupakan gambar daun sehat. Hasilnya, CNN dapat mendeteksi penyakit daun kelapa sawit dengan akurasi yang dihasilkan mencapai 99%.

Download Full-text

Convolutional neural network and its pretrained models for image classification and object detection: A survey

Concurrency and Computation Practice and Experience ◽

10.1002/cpe.6767 ◽

2021 ◽

Author(s):

Biswajit Jena ◽

Gopal Krishna Nayak ◽

Sanjay Saxena

Keyword(s):

Neural Network ◽

Object Detection ◽

Convolutional Neural Network ◽

Image Classification

Download Full-text

Image Classification and Object Detection Algorithm Based on Convolutional Neural Network

Science Insights ◽

10.15354/si.19.re117 ◽

2019 ◽

Vol 31 (1) ◽

pp. 85-100

Author(s):

Juan K. Leonard ◽

Keyword(s):

Neural Network ◽

Object Detection ◽

Convolutional Neural Network ◽

Image Classification ◽

Detection Algorithm

Download Full-text

Random Erasing Data Augmentation

Proceedings of the AAAI Conference on Artificial Intelligence ◽

10.1609/aaai.v34i07.7000 ◽

2020 ◽

Vol 34 (07) ◽

pp. 13001-13008 ◽

Cited By ~ 32

Author(s):

Zhun Zhong ◽

Liang Zheng ◽

Guoliang Kang ◽

Shaozi Li ◽

Yi Yang

Keyword(s):

Neural Network ◽

Object Detection ◽

Convolutional Neural Network ◽

Image Classification ◽

Data Augmentation ◽

Parameter Learning ◽

Identification Code ◽

Training Images ◽

Augmentation Techniques

In this paper, we introduce Random Erasing, a new data augmentation method for training the convolutional neural network (CNN). In training, Random Erasing randomly selects a rectangle region in an image and erases its pixels with random values. In this process, training images with various levels of occlusion are generated, which reduces the risk of over-fitting and makes the model robust to occlusion. Random Erasing is parameter learning free, easy to implement, and can be integrated with most of the CNN-based recognition models. Albeit simple, Random Erasing is complementary to commonly used data augmentation techniques such as random cropping and flipping, and yields consistent improvement over strong baselines in image classification, object detection and person re-identification. Code is available at: https://github.com/zhunzhong07/Random-Erasing.

Download Full-text