Convolutional Neural Network for the Semantic Segmentation of Remote Sensing Images

Muhammad Alam; Jian-Feng Wang; Cong Guangpei; LV Yunrong; Yuanfang Chen

doi:10.1007/s11036-020-01703-3

Convolutional Neural Network for the Semantic Segmentation of Remote Sensing Images

Mobile Networks and Applications ◽

10.1007/s11036-020-01703-3 ◽

2021 ◽

Vol 26 (1) ◽

pp. 200-215

Author(s):

Muhammad Alam ◽

Jian-Feng Wang ◽

Cong Guangpei ◽

LV Yunrong ◽

Yuanfang Chen

Keyword(s):

Neural Network ◽

Remote Sensing ◽

Neural Networks ◽

Image Processing ◽

Deep Learning ◽

Semantic Segmentation ◽

Natural Scene ◽

Remote Sensing Images ◽

Advantages And Disadvantages ◽

Target Segmentation

AbstractIn recent years, the success of deep learning in natural scene image processing boosted its application in the analysis of remote sensing images. In this paper, we applied Convolutional Neural Networks (CNN) on the semantic segmentation of remote sensing images. We improve the Encoder- Decoder CNN structure SegNet with index pooling and U-net to make them suitable for multi-targets semantic segmentation of remote sensing images. The results show that these two models have their own advantages and disadvantages on the segmentation of different objects. In addition, we propose an integrated algorithm that integrates these two models. Experimental results show that the presented integrated algorithm can exploite the advantages of both the models for multi-target segmentation and achieve a better segmentation compared to these two models.

Download Full-text

A Multi-Scale Water Extraction Convolutional Neural Network (MWEN) Method for GaoFen-1 Remote Sensing Images

ISPRS International Journal of Geo-Information ◽

10.3390/ijgi9040189 ◽

2020 ◽

Vol 9 (4) ◽

pp. 189 ◽

Cited By ~ 3

Author(s):

Hongxiang Guo ◽

Guojin He ◽

Wei Jiang ◽

Ranyu Yin ◽

Lei Yan ◽

...

Keyword(s):

Neural Network ◽

Remote Sensing ◽

Neural Networks ◽

Convolutional Neural Network ◽

Water Body ◽

Semantic Segmentation ◽

Water Bodies ◽

Water Extraction ◽

Remote Sensing Images ◽

Multi Scale

Automatic water body extraction method is important for monitoring floods, droughts, and water resources. In this study, a new semantic segmentation convolutional neural network named the multi-scale water extraction convolutional neural network (MWEN) is proposed to automatically extract water bodies from GaoFen-1 (GF-1) remote sensing images. Three convolutional neural networks for semantic segmentation (fully convolutional network (FCN), Unet, and Deeplab V3+) are employed to compare with the water bodies extraction performance of MWEN. Visual comparison and five evaluation metrics are used to evaluate the performance of these convolutional neural networks (CNNs). The results show the following. (1) The results of water body extraction in multiple scenes using the MWEN are better than those of the other comparison methods based on the indicators. (2) The MWEN method has the capability to accurately extract various types of water bodies, such as urban water bodies, open ponds, and plateau lakes. (3) By fusing features extracted at different scales, the MWEN has the capability to extract water bodies with different sizes and suppress noise, such as building shadows and highways. Therefore, MWEN is a robust water extraction algorithm for GaoFen-1 satellite images and has the potential to conduct water body mapping with multisource high-resolution satellite remote sensing data.

Download Full-text

Semantic Segmentation of Satellite Images using Deep Learning

International Journal of Innovative Technology and Exploring Engineering - Special Issue ◽

10.35940/ijitee.h9186.0610821 ◽

2021 ◽

Vol 10 (8) ◽

pp. 33-37

Author(s):

Chandra Pal Kushwah ◽

Kuruna Markam

Keyword(s):

Remote Sensing ◽

Deep Learning ◽

Deep Neural Networks ◽

Semantic Segmentation ◽

Remote Sensing Image ◽

Learning Performance ◽

Test Results ◽

Natural Scene ◽

Target Segmentation ◽

Segmentation Models

Bidirectional in recent years, Deep learning performance in natural scene image processing has improved its use in remote sensing image analysis. In this paper, we used the semantic segmentation of remote sensing images for deep neural networks (DNN). To make it ideal for multi-target semantic segmentation of remote sensing image systems, we boost the Seg Net encoder-decoder CNN structures with index pooling & U-net. The findings reveal that the segmentation of various objects has its benefits and drawbacks for both models. Furthermore, we provide an integrated algorithm that incorporates two models. The test results indicate that the integrated algorithm proposed will take advantage of all multi-target segmentation models and obtain improved segmentation relative to two models.

Download Full-text

THE REASON OF THE FIREWORK TYPE OF DEPTH EDUCATION IN PROCESSING THE DATA OF REMOTE SURFACES OF THE EARTH

Проблеми створення, випробування, застосування та експлуатації складних інформаційних систем ◽

10.46972/2076-1546.2019.16.07 ◽

2019 ◽

pp. 70-79

Author(s):

M. P. Romanchuk

Keyword(s):

Neural Network ◽

Remote Sensing ◽

Neural Networks ◽

Deep Learning ◽

Remote Sensing Data ◽

Earth Remote Sensing ◽

Technical Problems ◽

Sensing Data ◽

Advantages And Disadvantages ◽

Software Frameworks

An important task in the processing of Earth remote sensing data is the automation of the decoding process of aerospace images, in particular the detection and recognition of objects in military decoding. In the article the directions of automation of decryption of photos are considered and promising from them is selected, which is based on the use of neural networks of deep learning, and also analyzed the technical problems that arise during the creation of algorithms and the deployment of trained models on a variety of mobile devices. The important role of deep-instruction software frameworks in the process of training of neural network models is aimed at facilitating development and deployment. The changes in the popularity of software frameworks in recent years have been analyzed and the need to analyze their dynamically changing capabilities has been analyzed. The most widely used software frameworks for the implementation of deep learning approaches, their advantages and disadvantages for solving tasks of thematic decryption on accessible computational resources are explored. The types of computational graphs, which use the software of deep learning, and programming languages, with the help of which it is allowed to create and deploy models of neural networks are considered. The analysis of the frameworks according to selected criteria was performed: distributed execution, architecture optimization, reflection of the learning process, joint support and portability. As a result, the software framework to be used in conducting research is highlighted, and the conclusion is drawn about the predominant framework for industrial use in the course of in-depth training of the neural network for the processing of Earth remote sensing data.

Download Full-text

Research on Scene Classification Method of High-Resolution Remote Sensing Images Based on RFPNet

Applied Sciences ◽

10.3390/app9102028 ◽

2019 ◽

Vol 9 (10) ◽

pp. 2028

Author(s):

Xin Zhang ◽

Yongcheng Wang ◽

Ning Zhang ◽

Dongdong Xu ◽

Bo Chen

Keyword(s):

Neural Network ◽

Remote Sensing ◽

Neural Networks ◽

Deep Learning ◽

High Resolution ◽

Remote Sensing Image ◽

Visual Features ◽

Test Accuracy ◽

Scene Classification ◽

Remote Sensing Images

One of the challenges in the field of remote sensing is how to automatically identify and classify high-resolution remote sensing images. A number of approaches have been proposed. Among them, the methods based on low-level visual features and middle-level visual features have limitations. Therefore, this paper adopts the method of deep learning to classify scenes of high-resolution remote sensing images to learn semantic information. Most of the existing methods of convolutional neural networks are based on the existing model using transfer learning, while there are relatively few articles about designing of new convolutional neural networks based on the existing high-resolution remote sensing image datasets. In this context, this paper proposes a multi-view scaling strategy, a new convolutional neural network based on residual blocks and fusing strategy of pooling layer maps, and uses optimization methods to make the convolutional neural network named RFPNet more robust. Experiments on two benchmark remote sensing image datasets have been conducted. On the UC Merced dataset, the test accuracy, precision, recall, and F1-score all exceed 93%. On the SIRI-WHU dataset, the test accuracy, precision, recall, and F1-score all exceed 91%. Compared with the existing methods, such as the most traditional methods and some deep learning methods for scene classification of high-resolution remote sensing images, the proposed method has higher accuracy and robustness.

Download Full-text

Self-Attention in Reconstruction Bias U-Net for Semantic Segmentation of Building Rooftops in Optical Remote Sensing Images

Remote Sensing ◽

10.3390/rs13132524 ◽

2021 ◽

Vol 13 (13) ◽

pp. 2524

Author(s):

Ziyi Chen ◽

Dilong Li ◽

Wentao Fan ◽

Haiyan Guan ◽

Cheng Wang ◽

...

Keyword(s):

Remote Sensing ◽

Deep Learning ◽

Semantic Segmentation ◽

Extraction Methods ◽

The Self ◽

Optical Remote Sensing ◽

Building Extraction ◽

Learning Models ◽

Remote Sensing Images ◽

Segmentation Methods

Deep learning models have brought great breakthroughs in building extraction from high-resolution optical remote-sensing images. Among recent research, the self-attention module has called up a storm in many fields, including building extraction. However, most current deep learning models loading with the self-attention module still lose sight of the reconstruction bias’s effectiveness. Through tipping the balance between the abilities of encoding and decoding, i.e., making the decoding network be much more complex than the encoding network, the semantic segmentation ability will be reinforced. To remedy the research weakness in combing self-attention and reconstruction-bias modules for building extraction, this paper presents a U-Net architecture that combines self-attention and reconstruction-bias modules. In the encoding part, a self-attention module is added to learn the attention weights of the inputs. Through the self-attention module, the network will pay more attention to positions where there may be salient regions. In the decoding part, multiple large convolutional up-sampling operations are used for increasing the reconstruction ability. We test our model on two open available datasets: the WHU and Massachusetts Building datasets. We achieve IoU scores of 89.39% and 73.49% for the WHU and Massachusetts Building datasets, respectively. Compared with several recently famous semantic segmentation methods and representative building extraction methods, our method’s results are satisfactory.

Download Full-text

Semantic Segmentation of Remote Sensing Images Using Transfer Learning and Deep Convolutional Neural Network With Dense Connection

IEEE Access ◽

10.1109/access.2020.3003914 ◽

2020 ◽

Vol 8 ◽

pp. 116744-116755 ◽

Cited By ~ 1

Author(s):

Binge Cui ◽

Xin Chen ◽

Yan Lu

Keyword(s):

Neural Network ◽

Remote Sensing ◽

Convolutional Neural Network ◽

Transfer Learning ◽

Semantic Segmentation ◽

Deep Convolutional Neural Network ◽

Remote Sensing Images

Download Full-text

Road Extraction from Unmanned Aerial Vehicle Remote Sensing Images Based on Improved Neural Networks

Sensors ◽

10.3390/s19194115 ◽

2019 ◽

Vol 19 (19) ◽

pp. 4115 ◽

Cited By ~ 1

Author(s):

Yuxia Li ◽

Bo Peng ◽

Lei He ◽

Kunlong Fan ◽

Zhenxu Li ◽

...

Keyword(s):

Neural Network ◽

Remote Sensing ◽

Neural Networks ◽

Unmanned Aerial Vehicle ◽

Computational Efficiency ◽

Neural Nets ◽

Road Extraction ◽

Remote Sensing Images ◽

Feature Maps ◽

Aerial Vehicle

Roads are vital components of infrastructure, the extraction of which has become a topic of significant interest in the field of remote sensing. Because deep learning has been a popular method in image processing and information extraction, researchers have paid more attention to extracting road using neural networks. This article proposes the improvement of neural networks to extract roads from Unmanned Aerial Vehicle (UAV) remote sensing images. D-Linknet was first considered for its high performance; however, the huge scale of the net reduced computational efficiency. With a focus on the low computational efficiency problem of the popular D-LinkNet, this article made some improvements: (1) Replace the initial block with a stem block. (2) Rebuild the entire network based on ResNet units with a new structure, allowing for the construction of an improved neural network D-Linknetplus. (3) Add a 1 × 1 convolution layer before DBlock to reduce the input feature maps, reducing parameters and improving computational efficiency. Add another 1 × 1 convolution layer after DBlock to recover the required number of output channels. Accordingly, another improved neural network B-D-LinknetPlus was built. Comparisons were performed between the neural nets, and the verification were made with the Massachusetts Roads Dataset. The results show improved neural networks are helpful in reducing the network size and developing the precision needed for road extraction.

Download Full-text

Analysis of Encoder-Decoder Based Deep Learning Architectures for Semantic Segmentation in Remote Sensing Images

Advances in Intelligent Systems and Computing - Intelligent Systems Design and Applications ◽

10.1007/978-3-030-16660-1_33 ◽

2019 ◽

pp. 332-341

Author(s):

R. Sivagami ◽

J. Srihari ◽

K. S. Ravichandran

Keyword(s):

Remote Sensing ◽

Deep Learning ◽

Semantic Segmentation ◽

Remote Sensing Images ◽

Learning Architectures

Download Full-text

Performance Evaluation of Single-Label and Multi-Label Remote Sensing Image Retrieval Using a Dense Labeling Dataset

Remote Sensing ◽

10.3390/rs10060964 ◽

2018 ◽

Vol 10 (6) ◽

pp. 964 ◽

Cited By ~ 34

Author(s):

Zhenfeng Shao ◽

Ke Yang ◽

Weixun Zhou

Keyword(s):

Remote Sensing ◽

Performance Evaluation ◽

Deep Learning ◽

Image Retrieval ◽

Semantic Segmentation ◽

Semantic Content ◽

Remote Sensing Image ◽

Remote Sensing Images ◽

Benchmark Datasets ◽

Feature Based

Benchmark datasets are essential for developing and evaluating remote sensing image retrieval (RSIR) approaches. However, most of the existing datasets are single-labeled, with each image in these datasets being annotated by a single label representing the most significant semantic content of the image. This is sufficient for simple problems, such as distinguishing between a building and a beach, but multiple labels and sometimes even dense (pixel) labels are required for more complex problems, such as RSIR and semantic segmentation.We therefore extended the existing multi-labeled dataset collected for multi-label RSIR and presented a dense labeling remote sensing dataset termed "DLRSD". DLRSD contained a total of 17 classes, and the pixels of each image were assigned with 17 pre-defined labels. We used DLRSD to evaluate the performance of RSIR methods ranging from traditional handcrafted feature-based methods to deep learning-based ones. More specifically, we evaluated the performances of RSIR methods from both single-label and multi-label perspectives. These results demonstrated the advantages of multiple labels over single labels for interpreting complex remote sensing images. DLRSD provided the literature a benchmark for RSIR and other pixel-based problems such as semantic segmentation.

Download Full-text

Semantic segmentation of remote sensing images based on deep learning methods

10.1117/12.2615120 ◽

2021 ◽

Author(s):

Cong Huang ◽

Yao Yang ◽

Huajun Wang ◽

Yu Ma ◽

Jinquan Zhao ◽

...

Keyword(s):

Remote Sensing ◽

Deep Learning ◽

Semantic Segmentation ◽

Remote Sensing Images ◽

Learning Methods

Download Full-text