scholarly journals Object Identification and Localization Using Grad-CAM++ with Mask Regional Convolution Neural Network

Electronics ◽  
2021 ◽  
Vol 10 (13) ◽  
pp. 1541
Author(s):  
Xavier Alphonse Inbaraj ◽  
Charlyn Villavicencio ◽  
Julio Jerison Macrohon ◽  
Jyh-Horng Jeng ◽  
Jer-Guang Hsieh

One of the fundamental advancements in the deployment of object detectors in real-time applications is to improve object recognition against obstruction, obscurity, and noises in images. In addition, object detection is a challenging task since it needs the correct detection of objects from images. Semantic segmentation and localization are an important module to recognizing an object in an image. The object localization method (Grad-CAM++) is mostly used by researchers for object localization, which uses the gradient with a convolution layer to build a localization map for important regions on the image. This paper proposes a method called Combined Grad-CAM++ with the Mask Regional Convolution Neural Network (GC-MRCNN) in order to detect objects in the image and also localization. The major advantage of proposed method is that they outperform all the counterpart methods in the domain and can also be used in unsupervised environments. The proposed detector based on GC-MRCNN provides a robust and feasible ability in detecting and classifying objects exist and their shapes in real time. It is found that the proposed method is able to perform highly effectively and efficiently in a wide range of images and provides higher resolution visual representation than existing methods (Grad-CAM, Grad-CAM++), which was proven by comparing various algorithms.

Author(s):  
Bontle Goitsemedi ◽  
Kedibonye Keletso

Improving object identification against impediment, obscure and clamor image is a basic advance to deploy detector in real time applications. Since it is preposterous to expect to debilitate all picture abandons through information assortment, numerous specialists look to produce hard examples in preparing. The produced hard examples are either pictures or highlight maps with coarse patches exited in the spatial measurements. Huge overheads are needed in preparing the extra hard examples and additionally assessing drop-out patches utilizing additional organization branches. In this paper we proposed GRAD CAM++ with Mask Regional Convolution Neural Network (Mask RCNN) based item limitation and identification. The significant advantages of utilizing Mask R-CNN is that they beat all the partner techniques in the space and can likewise be utilized in unaided environments. The proposed identifier dependent on GRAD CAM++ with Mask R-CNN gives a vigorous and plausible capacity on recognizing and grouping objects exist and their shapes progressively on location. It is discovered that the proposed strategy can perform exceptionally successful and productive in a wide scope of pictures and gives higher goal visual portrayal.


2021 ◽  
Vol 13 (12) ◽  
pp. 307
Author(s):  
Vijayakumar Varadarajan ◽  
Dweepna Garg ◽  
Ketan Kotecha

Deep learning is a relatively new branch of machine learning in which computers are taught to recognize patterns in massive volumes of data. It primarily describes learning at various levels of representation, which aids in understanding data that includes text, voice, and visuals. Convolutional neural networks have been used to solve challenges in computer vision, including object identification, image classification, semantic segmentation and a lot more. Object detection in videos involves confirming the presence of the object in the image or video and then locating it accurately for recognition. In the video, modelling techniques suffer from high computation and memory costs, which may decrease performance measures such as accuracy and efficiency to identify the object accurately in real-time. The current object detection technique based on a deep convolution neural network requires executing multilevel convolution and pooling operations on the entire image to extract deep semantic properties from it. For large objects, detection models can provide superior results; however, those models fail to detect the varying size of the objects that have low resolution and are greatly influenced by noise because the features after the repeated convolution operations of existing models do not fully represent the essential characteristics of the objects in real-time. With the help of a multi-scale anchor box, the proposed approach reported in this paper enhances the detection accuracy by extracting features at multiple convolution levels of the object. The major contribution of this paper is to design a model to understand better the parameters and the hyper-parameters which affect the detection and the recognition of objects of varying sizes and shapes, and to achieve real-time object detection and recognition speeds by improving accuracy. The proposed model has achieved 84.49 mAP on the test set of the Pascal VOC-2007 dataset at 11 FPS, which is comparatively better than other real-time object detection models.


2021 ◽  
Vol 1827 (1) ◽  
pp. 012130
Author(s):  
Qi Li ◽  
Yun Qing Liu ◽  
Yue Qi Peng ◽  
Cong Liu ◽  
Jun Shi ◽  
...  

2021 ◽  
pp. 1-10
Author(s):  
Lipeng Si ◽  
Baolong Liu ◽  
Yanfang Fu

The important strategic position of military UAVs and the wide application of civil UAVs in many fields, they all mark the arrival of the era of unmanned aerial vehicles. At present, in the field of image research, recognition and real-time tracking of specific objects in images has been a technology that many scholars continue to study in depth and need to be further tackled. Image recognition and real-time tracking technology has been widely used in UAV aerial photography. Through the analysis of convolution neural network algorithm and the comparison of image recognition technology, the convolution neural network algorithm is improved to improve the image recognition effect. In this paper, a target detection technique based on improved Faster R-CNN is proposed. The algorithm model is implemented and the classification accuracy is improved through Faster R-CNN network optimization. Aiming at the problem of small target error detection and scale difference in aerial data sets, this paper designs the network structure of RPN and the optimization scheme of related algorithms. The structure of Faster R-CNN is adjusted by improving the embedding of CNN and OHEM algorithm, the accuracy of small target and multitarget detection is improved as a whole. The experimental results show that: compared with LENET-5, the recognition accuracy of the proposed algorithm is significantly improved. And with the increase of the number of samples, the accuracy of this algorithm is 98.9%.


2021 ◽  
Author(s):  
Yiming Lou ◽  
Zelin Hu ◽  
Miao Li ◽  
Hualong Li ◽  
Xuanjiang Yang ◽  
...  

Sign in / Sign up

Export Citation Format

Share Document