Insulator detection and recognition of explosion based on convolutional neural networks

Author(s):  
Bin Yan ◽  
Qing Chen ◽  
Run Ye ◽  
Xiaojia Zhou

Unmanned aerial vehicles (UAVs) equipped with high definition (HD) cameras can obtain a large number of detailed inspection images. The insulator is an indispensable component in the transmission lines. Detecting insulator in image video quickly and accurately can provide a reliable basis for the ranging and the obstacle avoidance flight of UAV close to the tower and transmission line. At the same time, the insulator is a serious threat to the safety of the power grid due to the multiple faults of the insulator, and the computer technology should be fully utilized to diagnose the fault. Detection of the insulator images with the complex aerial background is implemented by constructing a convolutional neural network (CNN), which has the classic architecture of five modules of convolution and pooling, two modules of fully connected layers. In this paper, we propose a recognition algorithm for explosion fault based on saliency detection, which uses the trained network model to extract the features. Then, we put the saliency maps into a self-organizing feature map (SOM) network and build the mathematical module via super pixel segmentation, contour detection and other image processing methods. The test shows that the algorithm can reduce the error that may be caused by manual analysis. It also demonstrates that the detection of the insulator and the recognition of explosion fault can effectively improve the efficiency and intelligence level.

2021 ◽  
Author(s):  
Yusuf Saber

In this work, three novel approaches to detecting visual attention in images are presented. The idea behind detecting areas within images or video that naturally attract a viewer’s attention is based on the concept of generating pre-attentive saliency maps. Saliency, in and of itself, relates to some measure of “conspicuity” in the visual field and is believed to be an important precursor for many tasks in computer vision. One of the proposed methods in this thesis detects salient regions, while the other two detect salient edges. The classical approach to saliency detection proposed by Itti is extended by introducing wavelets as a lossless resizing tool while maintaining the aspect of biological inspiration. In addition to this, the spectral residual method and the frequency tuned method are modified using wavelets to allow for salient edge detection. Tests show that the proposed methods yield results that are not only comparable to leading,cutting-edge methods, but also exceed them in terms of correct and complete object detection as well as noise reduction.


2021 ◽  
Author(s):  
Yusuf Saber

In this work, three novel approaches to detecting visual attention in images are presented. The idea behind detecting areas within images or video that naturally attract a viewer’s attention is based on the concept of generating pre-attentive saliency maps. Saliency, in and of itself, relates to some measure of “conspicuity” in the visual field and is believed to be an important precursor for many tasks in computer vision. One of the proposed methods in this thesis detects salient regions, while the other two detect salient edges. The classical approach to saliency detection proposed by Itti is extended by introducing wavelets as a lossless resizing tool while maintaining the aspect of biological inspiration. In addition to this, the spectral residual method and the frequency tuned method are modified using wavelets to allow for salient edge detection. Tests show that the proposed methods yield results that are not only comparable to leading,cutting-edge methods, but also exceed them in terms of correct and complete object detection as well as noise reduction.


2013 ◽  
Vol 765-767 ◽  
pp. 1401-1405
Author(s):  
Chi Zhang ◽  
Wei Qiang Wang

Object-level saliency detection is an important branch of visual saliency. In this paper, we propose a novel method which can conduct object-level saliency detection in both images and videos in a unified way. We employ a more effective spatial compactness assumption to measure saliency instead of the popular contrast assumption. In addition, we present a combination framework which integrates multiple saliency maps generated in different feature maps. The proposed algorithm can automatically select saliency maps of high quality according to the quality evaluation score we define. The experimental results demonstrate that the proposed method outperforms all state-of-the-art methods on both of the datasets of still images and video sequences.


Author(s):  
Urvish Trivedi ◽  
Jonielle McDonnough ◽  
Muhaimen Shamsi ◽  
Andrez Izurieta Ochoa ◽  
Alec Braynen ◽  
...  

Detecting humans and objects during walking has been a very difficult problem for people with visual impairment. To safely avoid collision with any object or human and to navigate from one location to another, it is significant to know how far and what kind of obstacle the user is facing. In recent years, many researches have shown that providing different vibration stimulation can be very useful to convey important information to the user. In this paper, we present our stereovision system with high definition camera to detect and identify humans and obstacles in real time and compare it with a modified version of existing wearable haptic belt that uses high-performance Ultrasonic sensors. The aim of this paper is to present the practicability of stereovision system over cane and assistive technology such as vibrotactile belt. The study is based on two assistive technologies. The first one consists of the vibrotactile belt connected to ultrasonic sensors and an accelerometer which returns user movement & speed information to the microcontroller. The microcontroller initiates expressive vibrotactile stimulation based on sensor data. Data gathered from this technology will be used as the baseline data for comparison with our stereovision system. Second, we present a novel approach to detect the type of obstacle using object recognition algorithm and the best approach to avoid it using the stereovision feedback. Data gathered from this technology with be comparted against the baseline data from the vibrotactile belt. In addition, we present the results of the comparative study which shows that stereovision system has plethora of advantages over vibrotactile belt.


2018 ◽  
Vol 61 (5) ◽  
pp. 1461-1474 ◽  
Author(s):  
Zhongqi Lin ◽  
Shaomin Mu ◽  
Aiju Shi ◽  
Chao Pang ◽  
Xiaoxiao Sun

Abstract. Traditional methods for detecting maize leaf diseases (such as leaf blight, sooty blotch, brown spot, rust, and purple leaf sheaf) are typically labor-intensive and strongly subjective. With the aim of achieving high accuracy and efficiency in the identification of maize leaf diseases from digital imagery, this article proposes a novel multichannel convolutional neural network (MCNN). The MCNN is composed of an input layer, five convolutional layers, three subsampling layers, three fully connected layers, and an output layer. Using a method that imitates human visual behavior in video saliency detection, the first and second subsampling layers are connected directly with the first fully connected layer. In addition, the mixed modes of pooling and normalization methods, rectified linear units (ReLU), and dropout are introduced to prevent overfitting and gradient diffusion. The learning process corresponding to the network structure is also illustrated. At present, there are no large-scale images of maize leaf disease for use as experimental samples. To test the proposed MCNN, 10,820 RGB images containing five types of disease were collected from maize planting areas in Shandong Province, China. The original images could not be used directly in identification experiments because of noise and irrelevant regions. They were therefore denoised and segmented by homomorphic filtering and region of interest (ROI) segmentation to construct a standard database. A series of experiments on 8 GB graphics processing units (GPUs) showed that the MCNN could achieve an average accuracy of 92.31% and a high efficiency in the identification of maize leaf diseases. The multichannel design and the integration of different innovations proved to be helpful methods for boosting performance. Keywords: Artificial intelligence, Convolutional neural network, Deep learning, Image classification, Machine learning algorithms, Maize leaf disease.


Entropy ◽  
2019 ◽  
Vol 21 (2) ◽  
pp. 165 ◽  
Author(s):  
Xiantao Jiang ◽  
Tian Song ◽  
Daqi Zhu ◽  
Takafumi Katayama ◽  
Lu Wang

Perceptual video coding (PVC) can provide a lower bitrate with the same visual quality compared with traditional H.265/high efficiency video coding (HEVC). In this work, a novel H.265/HEVC-compliant PVC framework is proposed based on the video saliency model. Firstly, both an effective and efficient spatiotemporal saliency model is used to generate a video saliency map. Secondly, a perceptual coding scheme is developed based on the saliency map. A saliency-based quantization control algorithm is proposed to reduce the bitrate. Finally, the simulation results demonstrate that the proposed perceptual coding scheme shows its superiority in objective and subjective tests, achieving up to a 9.46% bitrate reduction with negligible subjective and objective quality loss. The advantage of the proposed method is the high quality adapted for a high-definition video application.


Sensors ◽  
2020 ◽  
Vol 20 (2) ◽  
pp. 459
Author(s):  
Shaosheng Dai ◽  
Dongyang Li

Aiming at solving the problem of incomplete saliency detection and unclear boundaries in infrared multi-target images with different target sizes and low signal-to-noise ratio under sky background conditions, this paper proposes a saliency detection method for multiple targets based on multi-saliency detection. The multiple target areas of the infrared image are mainly bright and the background areas are dark. Combining with the multi-scale top hat (Top-hat) transformation, the image is firstly corroded and expanded to extract the subtraction of light and shade parts and reconstruct the image to reduce the interference of sky blurred background noise. Then the image obtained by a multi-scale Top-hat transformation is transformed from the time domain to the frequency domain, and the spectral residuals and phase spectrum are extracted directly to obtain two kinds of image saliency maps by multi-scale Gauss filtering reconstruction, respectively. On the other hand, the quaternion features are extracted directly to transform the phase spectrum, and then the phase spectrum is reconstructed to obtain one kind of image saliency map by the Gauss filtering. Finally, the above three saliency maps are fused to complete the saliency detection of infrared images. The test results show that after the experimental analysis of infrared video photographs and the comparative analysis of Receiver Operating Characteristic (ROC) curve and Area Under the Curve (AUC) index, the infrared image saliency map generated by this method has clear target details and good background suppression effect, and the AUC index performance is good, reaching over 99%. It effectively improves the multi-target saliency detection effect of the infrared image under the sky background and is beneficial to subsequent detection and tracking of image targets.


2020 ◽  
Vol 2020 ◽  
pp. 1-9 ◽  
Author(s):  
Liang Huang ◽  
Qiuzhi Peng ◽  
Xueqin Yu

In order to improve the change detection accuracy of multitemporal high spatial resolution remote-sensing (HSRRS) images, a change detection method of multitemporal remote-sensing images based on saliency detection and spatial intuitionistic fuzzy C-means (SIFCM) clustering is proposed. Firstly, the cluster-based saliency cue method is used to obtain the saliency maps of two temporal remote-sensing images; then, the saliency difference is obtained by subtracting the saliency maps of two temporal remote-sensing images; finally, the SIFCM clustering algorithm is used to classify the saliency difference image to obtain the change regions and unchange regions. Two data sets of multitemporal high spatial resolution remote-sensing images are selected as the experimental data. The detection accuracy of the proposed method is 96.17% and 97.89%. The results show that the proposed method is a feasible and better performance multitemporal remote-sensing image change detection method.


Author(s):  
Liming Li ◽  
Xiaodong Chai ◽  
Shuguang Zhao ◽  
Shubin Zheng ◽  
Shengchao Su

This paper proposes an effective method to elevate the performance of saliency detection via iterative bootstrap learning, which consists of two tasks including saliency optimization and saliency integration. Specifically, first, multiscale segmentation and feature extraction are performed on the input image successively. Second, prior saliency maps are generated using existing saliency models, which are used to generate the initial saliency map. Third, prior maps are fed into the saliency regressor together, where training samples are collected from the prior maps at multiple scales and the random forest regressor is learned from such training data. An integration of the initial saliency map and the output of saliency regressor is deployed to generate the coarse saliency map. Finally, in order to improve the quality of saliency map further, both initial and coarse saliency maps are fed into the saliency regressor together, and then the output of the saliency regressor, the initial saliency map as well as the coarse saliency map are integrated into the final saliency map. Experimental results on three public data sets demonstrate that the proposed method consistently achieves the best performance and significant improvement can be obtained when applying our method to existing saliency models.


Sign in / Sign up

Export Citation Format

Share Document