Improved Damage Characteristics Identification Method of Concrete CT Images Based on Region Convolutional Neural Network

The detection of internal damage characteristics of concrete is an important aspect of damage evolution mechanism in concrete meso-structure. In this paper, the improved Faster R-CNN is used to detect the porosity and cracks in concrete CT images. Based on the Faster R-CNN, ResNet-101 and ResNet-50 are used as the main framework. Feature pyramid network (FPN) and ROI Align are introduced to improve the performance of the model. FPN can generate high quality feature maps. ROI Align solves the region mismatch caused by the quantization operation. Experiments show that the detection accuracy of ResNet-101[Formula: see text]+[Formula: see text]FPN[Formula: see text]+[Formula: see text]ROI Align reaches 87.08%, which is 4.74 higher than that of ResNet-101. The detection accuracy of ResNet-50 [Formula: see text]+[Formula: see text] FPN [Formula: see text]+[Formula: see text] ROI Align reached 81.36%, which is 3.12% points higher than ResNet-50. These two improved algorithms are slower than the original algorithm for the detection time of a single picture. An effective method is provided to analyze concrete meso-damage evolution through the research.

Download Full-text

GC-YOLOv3: You Only Look Once with Global Context Block

Electronics ◽

10.3390/electronics9081235 ◽

2020 ◽

Vol 9 (8) ◽

pp. 1235

Author(s):

Yang Yang ◽

Hongmin Deng

Keyword(s):

Object Detection ◽

Irrelevant Information ◽

Detection Algorithm ◽

Visual Object ◽

Detection Accuracy ◽

Feature Maps ◽

Average Precision ◽

Global Context ◽

Pascal Voc ◽

Feature Pyramid

In order to make the classification and regression of single-stage detectors more accurate, an object detection algorithm named Global Context You-Only-Look-Once v3 (GC-YOLOv3) is proposed based on the You-Only-Look-Once (YOLO) in this paper. Firstly, a better cascading model with learnable semantic fusion between a feature extraction network and a feature pyramid network is designed to improve detection accuracy using a global context block. Secondly, the information to be retained is screened by combining three different scaling feature maps together. Finally, a global self-attention mechanism is used to highlight the useful information of feature maps while suppressing irrelevant information. Experiments show that our GC-YOLOv3 reaches a maximum of 55.5 object detection mean Average Precision (mAP)@0.5 on Common Objects in Context (COCO) 2017 test-dev and that the mAP is 5.1% higher than that of the YOLOv3 algorithm on Pascal Visual Object Classes (PASCAL VOC) 2007 test set. Therefore, experiments indicate that the proposed GC-YOLOv3 model exhibits optimal performance on the PASCAL VOC and COCO datasets.

Download Full-text

Foreign Body Detection in the Electrified Area of Urban Rail Trains Using Improved Yolov3 Algorithm

Tobacco Regulatory Science ◽

10.18001/trs.7.5.23 ◽

2021 ◽

Vol 7 (5) ◽

pp. 1059-1066

Author(s):

Chensong Wang ◽

Wei Cui ◽

Xingguang Li ◽

Xinrou Liu

Keyword(s):

Foreign Body ◽

Network Model ◽

Feature Fusion ◽

Normal Operation ◽

Detection Accuracy ◽

Feature Maps ◽

Spatial Feature ◽

Foreign Objects ◽

Urban Rail ◽

Feature Pyramid

Foreign body invade the electric receiving area of urban rail train, interfere with the operation of electric equipment on the roof, and affect the normal operation of urban rail traffic. Aiming at the problems of the traditional non-contact foreign body detection in the electric area of urban rail train, such as slow detection speed and poor detection accuracy of small target foreign body, An improved YOLOV3 (You Only Look Once) network model based on PAN feature pyramid structure and adaptive spatial feature fusion is proposed. By improving the main body of the YOLOv3 network model, it can alleviate the problem that the network prediction size map is too large and the experience field is too small. The features of different levels of foreign objects are initially fused with PAN’s feature pyramid to extract strong location information and strong semantic information of the foreign objects, then the method of adaptive spatial feature fusion was used to learn the spatial weights of the fusion of feature maps at various scales, obtaining more effective prediction feature maps at different scales after fusion and improving the detection ability of small targets. The improved k-means clustering algorithm is used to obtain the size of anchor and match it to the corresponding feature layer, which can mark the position of foreign body more accurately. Experimental results show that the detection accuracy of the improved YOLOV3 reaches 95.7%, which is 5.1% higher than the detection effect of the original network. It can accurately and quickly identify the different size of intrusive foreign body in the electric area of the roof of the urban rail train.

Download Full-text

Ablation studies on YOLOFruit detection algorithm for fruit harvesting robot using deep learning

IOP Conference Series Earth and Environmental Science ◽

10.1088/1755-1315/922/1/012001 ◽

2021 ◽

Vol 922 (1) ◽

pp. 012001

Author(s):

O M Lawal ◽

Z Huamin ◽

Z Fan

Keyword(s):

Detection Efficiency ◽

Activation Function ◽

Detection Algorithm ◽

Detection Accuracy ◽

Detection Time ◽

Detection Algorithms ◽

Spatial Pyramid Pooling ◽

Feature Pyramid ◽

Harvesting Robot ◽

Spatial Pyramid

Abstract Fruit detection algorithm as an integral part of harvesting robot is expected to be robust, accurate, and fast against environmental factors such as occlusion by stem and leaves, uneven illumination, overlapping fruit and many more. For this reason, this paper explored and compared ablation studies on proposed YOLOFruit, YOLOv4, and YOLOv5 detection algorithms. The final selected YOLOFruit algorithm used ResNet43 backbone with Combined activation function for feature extraction, Spatial Pyramid Pooling Network (SPPNet) for detection accuracies, Feature Pyramid Network (FPN) for feature pyramids, Distance Intersection Over Union-Non Maximum Suppression (DIoU-NMS) for detection efficiency and accuracy, and Complete Intersection Over Union (CIoU) loss for faster and better performance. The obtained results showed that the average detection accuracy of YOLOFruit at 86.2% is 1% greater than YOLOv4 at 85.2% and 4.3% higher than YOLOv5 at 81.9%, while the detection time of YOLOFruit at 11.9ms is faster than YOLOv4 at 16.6ms, but not with YOLOv5 at 2.7ms. Hence, the YOLOFruit detection algorithm is highly prospective for better generalization and real-time fruit detection.

Download Full-text

COVID-19 pneumonia diagnosis using a simple 2D deep learning framework with a single chest CT image (Preprint)

10.2196/preprints.19407 ◽

2020 ◽

Author(s):

Jinseok Lee

Keyword(s):

Deep Learning ◽

Diagnostic Performance ◽

Ct Images ◽

Chest Ct ◽

University Hospital ◽

Detection Accuracy ◽

Ct Image ◽

Test Dataset ◽

Learning Framework ◽

Testing Dataset

BACKGROUND The coronavirus disease (COVID-19) has explosively spread worldwide since the beginning of 2020. According to a multinational consensus statement from the Fleischner Society, computed tomography (CT) can be used as a relevant screening tool owing to its higher sensitivity for detecting early pneumonic changes. However, physicians are extremely busy fighting COVID-19 in this era of worldwide crisis. Thus, it is crucial to accelerate the development of an artificial intelligence (AI) diagnostic tool to support physicians. OBJECTIVE We aimed to quickly develop an AI technique to diagnose COVID-19 pneumonia and differentiate it from non-COVID pneumonia and non-pneumonia diseases on CT. METHODS A simple 2D deep learning framework, named fast-track COVID-19 classification network (FCONet), was developed to diagnose COVID-19 pneumonia based on a single chest CT image. FCONet was developed by transfer learning, using one of the four state-of-art pre-trained deep learning models (VGG16, ResNet50, InceptionV3, or Xception) as a backbone. For training and testing of FCONet, we collected 3,993 chest CT images of patients with COVID-19 pneumonia, other pneumonia, and non-pneumonia diseases from Wonkwang University Hospital, Chonnam National University Hospital, and the Italian Society of Medical and Interventional Radiology public database. These CT images were split into a training and a testing set at a ratio of 8:2. For the test dataset, the diagnostic performance to diagnose COVID-19 pneumonia was compared among the four pre-trained FCONet models. In addition, we tested the FCONet models on an additional external testing dataset extracted from the embedded low-quality chest CT images of COVID-19 pneumonia in recently published papers. RESULTS Of the four pre-trained models of FCONet, the ResNet50 showed excellent diagnostic performance (sensitivity 99.58%, specificity 100%, and accuracy 99.87%) and outperformed the other three pre-trained models in testing dataset. In additional external test dataset using low-quality CT images, the detection accuracy of the ResNet50 model was the highest (96.97%), followed by Xception, InceptionV3, and VGG16 (90.71%, 89.38%, and 87.12%, respectively). CONCLUSIONS The FCONet, a simple 2D deep learning framework based on a single chest CT image, provides excellent diagnostic performance in detecting COVID-19 pneumonia. Based on our testing dataset, the ResNet50-based FCONet might be the best model, as it outperformed other FCONet models based on VGG16, Xception, and InceptionV3.

Download Full-text

A Research on Landslides Automatic Extraction Model Based on the Improved Mask R-CNN

ISPRS International Journal of Geo-Information ◽

10.3390/ijgi10030168 ◽

2021 ◽

Vol 10 (3) ◽

pp. 168

Author(s):

Peng Liu ◽

Yongming Wei ◽

Qinjun Wang ◽

Jingjing Xie ◽

Yu Chen ◽

...

Keyword(s):

Remote Sensing Data ◽

Extraction Methods ◽

New Method ◽

Detection Accuracy ◽

Sufficient Information ◽

Automatic Extraction ◽

Emergency Rescue ◽

Feature Pyramid ◽

High Level ◽

Extraction Model

Landslides are the most common and destructive secondary geological hazards caused by earthquakes. It is difficult to extract landslides automatically based on remote sensing data, which is import for the scenario of disaster emergency rescue. The literature review showed that the current landslides extraction methods mostly depend on expert interpretation which was low automation and thus was unable to provide sufficient information for earthquake rescue in time. To solve the above problem, an end-to-end improved Mask R-CNN model was proposed. The main innovations of this paper were (1) replacing the feature extraction layer with an effective ResNeXt module to extract the landslides. (2) Increasing the bottom-up channel in the feature pyramid network to make full use of low-level positioning and high-level semantic information. (3) Adding edge losses to the loss function to improve the accuracy of the landslide boundary detection accuracy. At the end of this paper, Jiuzhaigou County, Sichuan Province, was used as the study area to evaluate the new model. Results showed that the new method had a precision of 95.8%, a recall of 93.1%, and an overall accuracy (OA) of 94.7%. Compared with the traditional Mask R-CNN model, they have been significantly improved by 13.9%, 13.4%, and 9.9%, respectively. It was proved that the new method was effective in the landslides automatic extraction.

Download Full-text

Multiview deep learning based on tensor decomposition and its application in fault detection of overhead contact systems

The Visual Computer ◽

10.1007/s00371-021-02080-y ◽

2021 ◽

Author(s):

Xuewu Zhang ◽

Yansheng Gong ◽

Chen Qiao ◽

Wenfeng Jing

Keyword(s):

High Speed ◽

Tensor Decomposition ◽

Detection Methods ◽

Detection Accuracy ◽

Feature Maps ◽

Training Time ◽

Detection Model ◽

Railway Line ◽

Result Show ◽

Deep Layers

AbstractThis article mainly focuses on the most common types of high-speed railways malfunctions in overhead contact systems, namely, unstressed droppers, foreign-body invasions, and pole number-plate malfunctions, to establish a deep-network detection model. By fusing the feature maps of the shallow and deep layers in the pretraining network, global and local features of the malfunction area are combined to enhance the network's ability of identifying small objects. Further, in order to share the fully connected layers of the pretraining network and reduce the complexity of the model, Tucker tensor decomposition is used to extract features from the fused-feature map. The operation greatly reduces training time. Through the detection of images collected on the Lanxin railway line, experiments result show that the proposed multiview Faster R-CNN based on tensor decomposition had lower miss probability and higher detection accuracy for the three types faults. Compared with object-detection methods YOLOv3, SSD, and the original Faster R-CNN, the average miss probability of the improved Faster R-CNN model in this paper is decreased by 37.83%, 51.27%, and 43.79%, respectively, and average detection accuracy is increased by 3.6%, 9.75%, and 5.9%, respectively.

Download Full-text

Bi-directional skip connection feature pyramid network and Sub-pixel convolution for high-quality object detection

Neurocomputing ◽

10.1016/j.neucom.2021.01.021 ◽

2021 ◽

Author(s):

Shuqi Xiong ◽

Xiaohong Wu ◽

Honggang Chen ◽

Linbo Qing ◽

Tong Chen ◽

...

Keyword(s):

Object Detection ◽

High Quality ◽

Feature Pyramid

Download Full-text

Improved YOLO Network for Free-Angle Remote Sensing Target Detection

Remote Sensing ◽

10.3390/rs13112171 ◽

2021 ◽

Vol 13 (11) ◽

pp. 2171

Author(s):

Yuhao Qing ◽

Wenyi Liu ◽

Liuyan Feng ◽

Wanjia Gao

Keyword(s):

Remote Sensing ◽

Feature Extraction ◽

Target Detection ◽

Multiple Scales ◽

Classification Problem ◽

Input Image ◽

Detection Accuracy ◽

Feature Maps ◽

Regression Problem ◽

Public Datasets

Despite significant progress in object detection tasks, remote sensing image target detection is still challenging owing to complex backgrounds, large differences in target sizes, and uneven distribution of rotating objects. In this study, we consider model accuracy, inference speed, and detection of objects at any angle. We also propose a RepVGG-YOLO network using an improved RepVGG model as the backbone feature extraction network, which performs the initial feature extraction from the input image and considers network training accuracy and inference speed. We use an improved feature pyramid network (FPN) and path aggregation network (PANet) to reprocess feature output by the backbone network. The FPN and PANet module integrates feature maps of different layers, combines context information on multiple scales, accumulates multiple features, and strengthens feature information extraction. Finally, to maximize the detection accuracy of objects of all sizes, we use four target detection scales at the network output to enhance feature extraction from small remote sensing target pixels. To solve the angle problem of any object, we improved the loss function for classification using circular smooth label technology, turning the angle regression problem into a classification problem, and increasing the detection accuracy of objects at any angle. We conducted experiments on two public datasets, DOTA and HRSC2016. Our results show the proposed method performs better than previous methods.

Download Full-text

Improved SSD-assisted algorithm for surface defect detection of electromagnetic luminescence

Proceedings of the Institution of Mechanical Engineers Part O Journal of Risk and Reliability ◽

10.1177/1748006x21995388 ◽

2021 ◽

pp. 1748006X2199538

Author(s):

Zhenying Xu ◽

Ziqian Wu ◽

Wei Fan

Keyword(s):

Defect Detection ◽

Feature Fusion ◽

Recognition Rate ◽

Detection Methods ◽

Small Scale ◽

Detection Accuracy ◽

Single Shot ◽

Surface Defect Detection ◽

Feature Pyramid ◽

Small Feature

Defect detection of electromagnetic luminescence (EL) cells is the core step in the production and preparation of solar cell modules to ensure conversion efficiency and long service life of batteries. However, due to the lack of feature extraction capability for small feature defects, the traditional single shot multibox detector (SSD) algorithm performs not well in EL defect detection with high accuracy. Consequently, an improved SSD algorithm with modification in feature fusion in the framework of deep learning is proposed to improve the recognition rate of EL multi-class defects. A dataset containing images with four different types of defects through rotation, denoising, and binarization is established for the EL. The proposed algorithm can greatly improve the detection accuracy of the small-scale defect with the idea of feature pyramid networks. An experimental study on the detection of the EL defects shows the effectiveness of the proposed algorithm. Moreover, a comparison study shows the proposed method outperforms other traditional detection methods, such as the SIFT, Faster R-CNN, and YOLOv3, in detecting the EL defect.

Download Full-text

Multi-Scale Feature Pyramid Network: A Heavily Occluded Pedestrian Detection Network Based on ResNet

Sensors ◽

10.3390/s21051820 ◽

2021 ◽

Vol 21 (5) ◽

pp. 1820

Author(s):

Xiaotao Shao ◽

Qing Wang ◽

Wei Yang ◽

Yun Chen ◽

Yi Xie ◽

...

Keyword(s):

Semantic Information ◽

Detection System ◽

Pedestrian Detection ◽

Detection Accuracy ◽

The Public ◽

Scale Feature ◽

Detection Algorithms ◽

Multi Scale ◽

Art Works ◽

Feature Pyramid

The existing pedestrian detection algorithms cannot effectively extract features of heavily occluded targets which results in lower detection accuracy. To solve the heavy occlusion in crowds, we propose a multi-scale feature pyramid network based on ResNet (MFPN) to enhance the features of occluded targets and improve the detection accuracy. MFPN includes two modules, namely double feature pyramid network (FPN) integrated with ResNet (DFR) and repulsion loss of minimum (RLM). We propose the double FPN which improves the architecture to further enhance the semantic information and contours of occluded pedestrians, and provide a new way for feature extraction of occluded targets. The features extracted by our network can be more separated and clearer, especially those heavily occluded pedestrians. Repulsion loss is introduced to improve the loss function which can keep predicted boxes away from the ground truths of the unrelated targets. Experiments carried out on the public CrowdHuman dataset, we obtain 90.96% AP which yields the best performance, 5.16% AP gains compared to the FPN-ResNet50 baseline. Compared with the state-of-the-art works, the performance of the pedestrian detection system has been boosted with our method.

Download Full-text