Semantic Segmentation of Remote Sensing Image Based on Convolutional Neural Network and Mask Generation

Mathematical Problems in Engineering ◽

10.1155/2021/2472726 ◽

2021 ◽

Vol 2021 ◽

pp. 1-13

Author(s):

Binglin Niu

Keyword(s):

Neural Network ◽

Remote Sensing ◽

High Resolution ◽

Convolutional Neural Network ◽

Semantic Segmentation ◽

Layer By Layer ◽

Foreground Object ◽

Remote Sensing Images ◽

Training Time ◽

High Level

High-resolution remote sensing images usually contain complex semantic information and confusing targets, so their semantic segmentation is an important and challenging task. To resolve the problem of inadequate utilization of multilayer features by existing methods, a semantic segmentation method for remote sensing images based on convolutional neural network and mask generation is proposed. In this method, the boundary box is used as the initial foreground segmentation profile, and the edge information of the foreground object is obtained by using the multilayer feature of the convolutional neural network. In order to obtain the rough object segmentation mask, the general shape and position of the foreground object are estimated by using the high-level features in the process of layer-by-layer iteration. Then, based on the obtained rough mask, the mask is updated layer by layer using the neural network characteristics to obtain a more accurate mask. In order to solve the difficulty of deep neural network training and the problem of degeneration after convergence, a framework based on residual learning was adopted, which can simplify the training of those very deep networks and improve the accuracy of the network. For comparison with other advanced algorithms, the proposed algorithm was tested on the Potsdam and Vaihingen datasets. Experimental results show that, compared with other algorithms, the algorithm in this article can effectively improve the overall precision of semantic segmentation of high-resolution remote sensing images and shorten the overall training time and segmentation time.

Semantic Segmentation of Remote Sensing Images Using Transfer Learning and Deep Convolutional Neural Network With Dense Connection

IEEE Access ◽

10.1109/access.2020.3003914 ◽

2020 ◽

Vol 8 ◽

pp. 116744-116755 ◽

Cited By ~ 1

Author(s):

Binge Cui ◽

Xin Chen ◽

Yan Lu

Keyword(s):

Neural Network ◽

Remote Sensing ◽

Convolutional Neural Network ◽

Transfer Learning ◽

Semantic Segmentation ◽

Deep Convolutional Neural Network ◽

Remote Sensing Images

A multi-level context-guided classification method with object-based convolutional neural network for land cover classification using very high resolution remote sensing images

International Journal of Applied Earth Observation and Geoinformation ◽

10.1016/j.jag.2020.102086 ◽

2020 ◽

Vol 88 ◽

pp. 102086 ◽

Cited By ~ 8

Author(s):

Chenxiao Zhang ◽

Peng Yue ◽

Deodato Tapete ◽

Boyi Shangguan ◽

Mi Wang ◽

...

Keyword(s):

Neural Network ◽

Remote Sensing ◽

High Resolution ◽

Land Cover ◽

Convolutional Neural Network ◽

Land Cover Classification ◽

Remote Sensing Images ◽

Object Based ◽

Multi Level ◽

Very High

A Multi-Scale Water Extraction Convolutional Neural Network (MWEN) Method for GaoFen-1 Remote Sensing Images

ISPRS International Journal of Geo-Information ◽

10.3390/ijgi9040189 ◽

2020 ◽

Vol 9 (4) ◽

pp. 189 ◽

Cited By ~ 3

Author(s):

Hongxiang Guo ◽

Guojin He ◽

Wei Jiang ◽

Ranyu Yin ◽

Lei Yan ◽

...

Keyword(s):

Neural Network ◽

Remote Sensing ◽

Neural Networks ◽

Convolutional Neural Network ◽

Water Body ◽

Semantic Segmentation ◽

Water Bodies ◽

Water Extraction ◽

Remote Sensing Images ◽

Multi Scale

Automatic water body extraction method is important for monitoring floods, droughts, and water resources. In this study, a new semantic segmentation convolutional neural network named the multi-scale water extraction convolutional neural network (MWEN) is proposed to automatically extract water bodies from GaoFen-1 (GF-1) remote sensing images. Three convolutional neural networks for semantic segmentation (fully convolutional network (FCN), Unet, and Deeplab V3+) are employed to compare with the water bodies extraction performance of MWEN. Visual comparison and five evaluation metrics are used to evaluate the performance of these convolutional neural networks (CNNs). The results show the following. (1) The results of water body extraction in multiple scenes using the MWEN are better than those of the other comparison methods based on the indicators. (2) The MWEN method has the capability to accurately extract various types of water bodies, such as urban water bodies, open ponds, and plateau lakes. (3) By fusing features extracted at different scales, the MWEN has the capability to extract water bodies with different sizes and suppress noise, such as building shadows and highways. Therefore, MWEN is a robust water extraction algorithm for GaoFen-1 satellite images and has the potential to conduct water body mapping with multisource high-resolution satellite remote sensing data.

High-Resolution Boundary Refined Convolutional Neural Network for Automatic Agricultural Greenhouses Extraction from GaoFen-2 Satellite Imageries

Remote Sensing ◽

10.3390/rs13214237 ◽

2021 ◽

Vol 13 (21) ◽

pp. 4237

Author(s):

Xiaoping Zhang ◽

Bo Cheng ◽

Jinfen Chen ◽

Chenbin Liang

Keyword(s):

Neural Network ◽

Remote Sensing ◽

High Resolution ◽

Convolutional Neural Network ◽

Morphological Characteristics ◽

Scientific Management ◽

Spatial Gradient ◽

Remote Sensing Images ◽

Feature Maps ◽

Precise Identification

Agricultural greenhouses (AGs) are an important component of modern facility agriculture, and accurately mapping and dynamically monitoring their distribution are necessary for agricultural scientific management and planning. Semantic segmentation can be adopted for AG extraction from remote sensing images. However, the feature maps obtained by traditional deep convolutional neural network (DCNN)-based segmentation algorithms blur spatial details and insufficient attention is usually paid to contextual representation. Meanwhile, the maintenance of the original morphological characteristics, especially the boundaries, is still a challenge for precise identification of AGs. To alleviate these problems, this paper proposes a novel network called high-resolution boundary refined network (HBRNet). In this method, we design a new backbone with multiple paths based on HRNetV2 aiming to preserve high spatial resolution and improve feature extraction capability, in which the Pyramid Cross Channel Attention (PCCA) module is embedded to residual blocks to strengthen the interaction of multiscale information. Moreover, the Spatial Enhancement (SE) module is employed to integrate the contextual information of different scales. In addition, we introduce the Spatial Gradient Variation (SGV) unit in the Boundary Refined (BR) module to couple the segmentation task and boundary learning task, so that they can share latent high-level semantics and interact with each other, and combine this with the joint loss to refine the boundary. In our study, GaoFen-2 remote sensing images in Shouguang City, Shandong Province, China are selected to make the AG dataset. The experimental results show that HBRNet demonstrates a significant improvement in segmentation performance up to an IoU score of 94.89%, implying that this approach has advantages and potential for precise identification of AGs.

Deep convolutional neural network based large-scale oil palm tree detection for high-resolution remote sensing images

2017 IEEE International Geoscience and Remote Sensing Symposium (IGARSS) ◽

10.1109/igarss.2017.8127085 ◽

2017 ◽

Cited By ~ 2

Author(s):

Weijia Li ◽

Haohuan Fu ◽

Le Yu

Keyword(s):

Neural Network ◽

Remote Sensing ◽

High Resolution ◽

Convolutional Neural Network ◽

Oil Palm ◽

Large Scale ◽

Deep Convolutional Neural Network ◽

Palm Tree ◽

Remote Sensing Images ◽

Tree Detection

Water Identification from High-Resolution Remote Sensing Images Based on Multidimensional Densely Connected Convolutional Neural Networks

Remote Sensing ◽

10.3390/rs12050795 ◽

2020 ◽

Vol 12 (5) ◽

pp. 795 ◽

Cited By ~ 2

Author(s):

Guojie Wang ◽

Mengjuan Wu ◽

Xikun Wei ◽

Huihui Song

Keyword(s):

Neural Network ◽

Remote Sensing ◽

Neural Networks ◽

Convolutional Neural Network ◽

Convolutional Neural Networks ◽

Lake Area ◽

Remote Sensing Images ◽

Index Method ◽

Training Time ◽

Water Index

The accurate acquisition of water information from remote sensing images has become important in water resources monitoring and protections, and flooding disaster assessment. However, there are significant limitations in the traditionally used index for water body identification. In this study, we have proposed a deep convolutional neural network (CNN), based on the multidimensional densely connected convolutional neural network (DenseNet), for identifying water in the Poyang Lake area. The results from DenseNet were compared with the classical convolutional neural networks (CNNs): ResNet, VGG, SegNet and DeepLab v3+, and also compared with the Normalized Difference Water Index (NDWI). Results have indicated that CNNs are superior to the water index method. Among the five CNNs, the proposed DenseNet requires the shortest training time for model convergence, besides DeepLab v3+. The identification accuracies are evaluated through several error metrics. It is shown that the DenseNet performs much better than the other CNNs and the NDWI method considering the precision of identification results; among those, the NDWI performance is by far the poorest. It is suggested that the DenseNet is much better in distinguishing water from clouds and mountain shadows than other CNNs.

Convolutional Neural Network for Building Extraction from High-Resolution Remote Sensing Images

2020 International Conference on Machine Vision and Image Processing (MVIP) ◽

10.1109/mvip49855.2020.9187483 ◽

2020 ◽

Author(s):

Hamidreza Hosseinpoor ◽

Farhad Samadzadegan

Keyword(s):

Neural Network ◽

Remote Sensing ◽

High Resolution ◽

Convolutional Neural Network ◽

Building Extraction ◽

Remote Sensing Images

Combining Deep Semantic Segmentation Network and Graph Convolutional Neural Network for Semantic Segmentation of Remote Sensing Imagery

Remote Sensing ◽

10.3390/rs13010119 ◽

2020 ◽

Vol 13 (1) ◽

pp. 119

Author(s):

Song Ouyang ◽

Yansheng Li

Keyword(s):

Neural Network ◽

Remote Sensing ◽

Convolutional Neural Network ◽

Spatial Information ◽

Spatial Relationship ◽

Semantic Segmentation ◽

Extraction Ability ◽

Feature Maps ◽

High Level ◽

Graph Nodes

Although the deep semantic segmentation network (DSSN) has been widely used in remote sensing (RS) image semantic segmentation, it still does not fully mind the spatial relationship cues between objects when extracting deep visual features through convolutional filters and pooling layers. In fact, the spatial distribution between objects from different classes has a strong correlation characteristic. For example, buildings tend to be close to roads. In view of the strong appearance extraction ability of DSSN and the powerful topological relationship modeling capability of the graph convolutional neural network (GCN), a DSSN-GCN framework, which combines the advantages of DSSN and GCN, is proposed in this paper for RS image semantic segmentation. To lift the appearance extraction ability, this paper proposes a new DSSN called the attention residual U-shaped network (AttResUNet), which leverages residual blocks to encode feature maps and the attention module to refine the features. As far as GCN, the graph is built, where graph nodes are denoted by the superpixels and the graph weight is calculated by considering the spectral information and spatial information of the nodes. The AttResUNet is trained to extract the high-level features to initialize the graph nodes. Then the GCN combines features and spatial relationships between nodes to conduct classification. It is worth noting that the usage of spatial relationship knowledge boosts the performance and robustness of the classification module. In addition, benefiting from modeling GCN on the superpixel level, the boundaries of objects are restored to a certain extent and there are less pixel-level noises in the final classification result. Extensive experiments on two publicly open datasets show that DSSN-GCN model outperforms the competitive baseline (i.e., the DSSN model) and the DSSN-GCN when adopting AttResUNet achieves the best performance, which demonstrates the advance of our method.

Research on Semantic Segmentation of High-resolution Remote Sensing Image Based on Full Convolutional Neural Network

2018 12th International Symposium on Antennas, Propagation and EM Theory (ISAPE) ◽

10.1109/isape.2018.8634106 ◽

2018 ◽

Cited By ~ 2

Author(s):

Xiaomeng Fu ◽

Huiming Qu

Keyword(s):

Neural Network ◽

Remote Sensing ◽

High Resolution ◽

Convolutional Neural Network ◽

Semantic Segmentation ◽

Remote Sensing Image

Intelligent High-Resolution Geological Mapping Based on SLIC-CNN

ISPRS International Journal of Geo-Information ◽

10.3390/ijgi9020099 ◽

2020 ◽

Vol 9 (2) ◽

pp. 99

Author(s):

Xuejia Sang ◽

Linfu Xue ◽

Xiangjin Ran ◽

Xiaoshun Li ◽

Jiwen Liu ◽

...

Keyword(s):

Neural Network ◽

Remote Sensing ◽

Rock Mass ◽

High Resolution ◽

Convolutional Neural Network ◽

Area Under The Curve ◽

Mapping Method ◽

Geological Mapping ◽

Liaoning Province ◽

Remote Sensing Images

High-resolution geological mapping is an important supporting condition for mineral and energy exploration. However, high-resolution geological mapping work still faces many problems. At present, high-resolution geological mapping is still generated by expert interpretation of survey lines, compasses, and field data. The work in the field is constrained by the weather, terrain, and personnel, and the working methods need to be improved. This paper proposes a new method for high-resolution mapping using Unmanned Aerial Vehicle (UAV) and deep learning algorithms. This method uses the UAV to collect high-resolution remote sensing images, cooperates with some groundwork to anchor the lithology, and then completes most of the mapping work on high-resolution remote sensing images. This method transfers a large amount of field work into the room and provides an automatic mapping process based on the Simple Linear Iterative Clustering-Convolutional Neural Network (SLIC-CNN) algorithm. It uses the convolutional neural network (CNN) to identify the image content and confirms the lithologic distribution, the simple linear iterative cluster (SLIC) algorithm can be used to outline the boundary of the rock mass and determine the contact interface of the rock mass, and the mode and expert decision method is used to clarify the results of the fusion and mapping. The mapping method was applied to the Taili waterfront in Xingcheng City, Liaoning Province, China. In this study, the Area Under the Curve (AUC) of the mapping method was 0.937. The Kappa test result was k = 0.8523, and a high-resolution geological map was obtained.