MTI-YOLO: A Light-Weight and Real-Time Deep Neural Network for Insulator Detection in Complex Aerial Images

Insulator detection is an essential task for the safety and reliable operation of intelligent grids. Owing to insulator images including various background interferences, most traditional image-processing methods cannot achieve good performance. Some You Only Look Once (YOLO) networks are employed to meet the requirements of actual applications for insulator detection. To achieve a good trade-off among accuracy, running time, and memory storage, this work proposes the modified YOLO-tiny for insulator (MTI-YOLO) network for insulator detection in complex aerial images. First of all, composite insulator images are collected in common scenes and the “CCIN_detection” (Chinese Composite INsulator) dataset is constructed. Secondly, to improve the detection accuracy of different sizes of insulator, multi-scale feature detection headers, a structure of multi-scale feature fusion, and the spatial pyramid pooling (SPP) model are adopted to the MTI-YOLO network. Finally, the proposed MTI-YOLO network and the compared networks are trained and tested on the “CCIN_detection” dataset. The average precision (AP) of our proposed network is 17% and 9% higher than YOLO-tiny and YOLO-v2. Compared with YOLO-tiny and YOLO-v2, the running time of the proposed network is slightly higher. Furthermore, the memory usage of the proposed network is 25.6% and 38.9% lower than YOLO-v2 and YOLO-v3, respectively. Experimental results and analysis validate that the proposed network achieves good performance in both complex backgrounds and bright illumination conditions.

Download Full-text

Attentional single-shot network with multi-scale feature fusion for object detection in aerial images

2020 Chinese Automation Congress (CAC) ◽

10.1109/cac51589.2020.9326692 ◽

2020 ◽

Author(s):

Yusheng Wang ◽

Hongzhang Wang ◽

Eryong Tang ◽

Ye Liu

Keyword(s):

Object Detection ◽

Feature Fusion ◽

Aerial Images ◽

Single Shot ◽

Scale Feature ◽

Multi Scale

Download Full-text

An Image Multi-scale Feature Recognition Method Based on Image Saliency

International Journal of Circuits, Systems and Signal Processing ◽

10.46300/9106.2021.15.32 ◽

2021 ◽

Vol 15 ◽

pp. 280-287

Author(s):

Chengzhi Yang

Keyword(s):

Image Recognition ◽

Feature Recognition ◽

Feature Fusion ◽

Image Feature ◽

Detection Accuracy ◽

Recognition Method ◽

Salient Region Detection ◽

Adaptive Combination ◽

Scale Feature ◽

Multi Scale

Image recognition refers to the technology which processes, analyzes and understands images with computer so as to recognize various targets and objects of different patterns. To effectively combine image recognition and intelligent algorithm can enhance the efficiency of image feature analysis, improve the detection accuracy and guarantee real-time detection. In image feature recognition, the following problems exist: the description of accurate object features, object blockage, complex and changeable scenes. Whether these problems can be effectively solved has great significance in improving the stability and robustness of object recognition algorithm. This paper takes image salience as the fundamental framework, and makes in-depth study of the problems of effective object appearance description, multi-feature fusion and multi-feature adaptive combination. Then it proposes an image multi-scale feature recognition method based on image salience and it can better locate the saliency object in the image, and more evenly highlight the salient object and significantly suppress background noises. The experiment results prove that salient region detection algorithm can better stress the entire salient image.

Download Full-text

A Single Shot Framework with Multi-Scale Feature Fusion for Geospatial Object Detection

Remote Sensing ◽

10.3390/rs11050594 ◽

2019 ◽

Vol 11 (5) ◽

pp. 594 ◽

Cited By ~ 11

Author(s):

Shuo Zhuang ◽

Ping Wang ◽

Boran Jiang ◽

Gang Wang ◽

Cong Wang

Keyword(s):

Remote Sensing ◽

Object Detection ◽

Large Scale ◽

Feature Fusion ◽

Aerial Images ◽

Detection Methods ◽

Single Shot ◽

Feature Maps ◽

Scale Feature ◽

Multi Scale

With the rapid advances in remote-sensing technologies and the larger number of satellite images, fast and effective object detection plays an important role in understanding and analyzing image information, which could be further applied to civilian and military fields. Recently object detection methods with region-based convolutional neural network have shown excellent performance. However, these two-stage methods contain region proposal generation and object detection procedures, resulting in low computation speed. Because of the expensive manual costs, the quantity of well-annotated aerial images is scarce, which also limits the progress of geospatial object detection in remote sensing. In this paper, on the one hand, we construct and release a large-scale remote-sensing dataset for geospatial object detection (RSD-GOD) that consists of 5 different categories with 18,187 annotated images and 40,990 instances. On the other hand, we design a single shot detection framework with multi-scale feature fusion. The feature maps from different layers are fused together through the up-sampling and concatenation blocks to predict the detection results. High-level features with semantic information and low-level features with fine details are fully explored for detection tasks, especially for small objects. Meanwhile, a soft non-maximum suppression strategy is put into practice to select the final detection results. Extensive experiments have been conducted on two datasets to evaluate the designed network. Results show that the proposed approach achieves a good detection performance and obtains the mean average precision value of 89.0% on a newly constructed RSD-GOD dataset and 83.8% on the Northwestern Polytechnical University very high spatial resolution-10 (NWPU VHR-10) dataset at 18 frames per second (FPS) on a NVIDIA GTX-1080Ti GPU.

Download Full-text

Controllably Deep Supervision and Multi-Scale Feature Fusion Network for Cloud and Snow Detection Based on Medium- and High-Resolution Imagery Dataset

Remote Sensing ◽

10.3390/rs13234805 ◽

2021 ◽

Vol 13 (23) ◽

pp. 4805

Author(s):

Guangbin Zhang ◽

Xianjun Gao ◽

Yuanwei Yang ◽

Mingwei Wang ◽

Shuhao Ran

Keyword(s):

Remote Sensing ◽

Deep Learning ◽

High Resolution ◽

Feature Fusion ◽

Detection Accuracy ◽

Learning Methods ◽

Training Requirements ◽

Scale Feature ◽

Multi Scale ◽

Hinge Loss

Clouds and snow in remote sensing imageries cover underlying surface information, reducing image availability. Moreover, they interact with each other, decreasing the cloud and snow detection accuracy. In this study, we propose a convolutional neural network for cloud and snow detection, named the cloud and snow detection network (CSD-Net). It incorporates the multi-scale feature fusion module (MFF) and the controllably deep supervision and feature fusion structure (CDSFF). MFF can capture and aggregate features at various scales, ensuring that the extracted high-level semantic features of clouds and snow are more distinctive. CDSFF can provide a deeply supervised mechanism with hinge loss and combine information from adjacent layers to gain more representative features. It ensures the gradient flow is more oriented and error-less, while retaining more effective information. Additionally, a high-resolution cloud and snow dataset based on WorldView2 (CSWV) was created and released. This dataset meets the training requirements of deep learning methods for clouds and snow in high-resolution remote sensing images. Based on the datasets with varied resolutions, CSD-Net is compared to eight state-of-the-art deep learning methods. The experiment results indicate that CSD-Net has an excellent detection accuracy and efficiency. Specifically, the mean intersection over the union (MIoU) of CSD-Net is the highest in the corresponding experiment. Furthermore, the number of parameters in our proposed network is just 7.61 million, which is the lowest of the tested methods. It only has 88.06 GFLOPs of floating point operations, which is less than the U-Net, DeepLabV3+, PSPNet, SegNet-Modified, MSCFF, and GeoInfoNet. Meanwhile, CSWV has a higher annotation quality since the same method can obtain a greater accuracy on it.

Download Full-text

Edge Intelligent Perception Method for Power Grid Icing Condition Based on Multi-Scale Feature Fusion Target Detection and Model Quantization

Frontiers in Energy Research ◽

10.3389/fenrg.2021.754335 ◽

2021 ◽

Vol 9 ◽

Author(s):

Fuqi Ma ◽

Bo Wang ◽

Min Li ◽

Xuzhu Dong ◽

Yifan Mao ◽

...

Keyword(s):

Transmission Line ◽

Target Detection ◽

Power Transmission ◽

Feature Fusion ◽

Classical Method ◽

Power Transmission Line ◽

Stable Operation ◽

Detection Accuracy ◽

Scale Feature ◽

Multi Scale

Insulator is an important equipment of power transmission line. Insulator icing can seriously affect the stable operation of power transmission line. So insulator icing condition monitoring has great significance of the safety and stability of power system. Therefore, this paper proposes a lightweight intelligent recognition method of insulator icing thickness for front-end ice monitoring device. In this method, the residual network (ResNet) and feature pyramid network (FPN) are fused to construct a multi-scale feature extraction network framework, so that the shallow features and deep features are fused to reduce the information loss and improve the target detection accuracy. Then, the full convolution neural network (FCN) is used to classify and regress the iced insulator, so as to realize the high-precision identification of icing thickness. Finally, the proposed method is compressed by model quantization to reduce the size and parameters of the model for adapting the icing monitoring terminal with limited computing resources, and the performance of the method is verified and compared with other classical method on the edge intelligent chip.

Download Full-text

Bearing fault diagnosis based on DNN using multi-scale feature fusion

2020 35th Youth Academic Annual Conference of Chinese Association of Automation (YAC) ◽

10.1109/yac51587.2020.9337689 ◽

2020 ◽

Author(s):

Funa Zhou ◽

Zhiqiang Zhang ◽

Danmin Chen

Keyword(s):

Fault Diagnosis ◽

Feature Fusion ◽

Scale Feature ◽

Bearing Fault ◽

Bearing Fault Diagnosis ◽

Multi Scale

Download Full-text

Real-Time Robotic Grasp Detection with Multi-Scale Feature Fusion

2020 IEEE International Conference on Real-time Computing and Robotics (RCAR) ◽

10.1109/rcar49640.2020.9303319 ◽

2020 ◽

Author(s):

Hao Ma ◽

Ding Yuan ◽

Zhe Cao ◽

Jihao Yin

Keyword(s):

Real Time ◽

Feature Fusion ◽

Scale Feature ◽

Multi Scale ◽

Robotic Grasp

Download Full-text

Deep learning-based tool wear prediction and its application for machining process using multi-scale feature fusion and channel attention mechanism

Measurement ◽

10.1016/j.measurement.2021.109254 ◽

2021 ◽

Vol 177 ◽

pp. 109254

Author(s):

Xingwei Xu ◽

Jianwen Wang ◽

Bingfu Zhong ◽

Weiwei Ming ◽

Ming Chen

Keyword(s):

Deep Learning ◽

Tool Wear ◽

Feature Fusion ◽

Attention Mechanism ◽

Machining Process ◽

Wear Prediction ◽

Scale Feature ◽

Multi Scale ◽

Tool Wear Prediction

Download Full-text

Pyramid-attention based multi-scale feature fusion network for multispectral pan-sharpening

Applied Intelligence ◽

10.1007/s10489-021-02732-5 ◽

2021 ◽

Author(s):

Yang Chi ◽

Jinjiang Li ◽

Hui Fan

Keyword(s):

Feature Fusion ◽

Scale Feature ◽

Multi Scale

Download Full-text

Multi-Scale Feature Pyramid Network: A Heavily Occluded Pedestrian Detection Network Based on ResNet

Sensors ◽

10.3390/s21051820 ◽

2021 ◽

Vol 21 (5) ◽

pp. 1820

Author(s):

Xiaotao Shao ◽

Qing Wang ◽

Wei Yang ◽

Yun Chen ◽

Yi Xie ◽

...

Keyword(s):

Semantic Information ◽

Detection System ◽

Pedestrian Detection ◽

Detection Accuracy ◽

The Public ◽

Scale Feature ◽

Detection Algorithms ◽

Multi Scale ◽

Art Works ◽

Feature Pyramid

The existing pedestrian detection algorithms cannot effectively extract features of heavily occluded targets which results in lower detection accuracy. To solve the heavy occlusion in crowds, we propose a multi-scale feature pyramid network based on ResNet (MFPN) to enhance the features of occluded targets and improve the detection accuracy. MFPN includes two modules, namely double feature pyramid network (FPN) integrated with ResNet (DFR) and repulsion loss of minimum (RLM). We propose the double FPN which improves the architecture to further enhance the semantic information and contours of occluded pedestrians, and provide a new way for feature extraction of occluded targets. The features extracted by our network can be more separated and clearer, especially those heavily occluded pedestrians. Repulsion loss is introduced to improve the loss function which can keep predicted boxes away from the ground truths of the unrelated targets. Experiments carried out on the public CrowdHuman dataset, we obtain 90.96% AP which yields the best performance, 5.16% AP gains compared to the FPN-ResNet50 baseline. Compared with the state-of-the-art works, the performance of the pedestrian detection system has been boosted with our method.

Download Full-text