CCT: Conditional Co-Training for Truly Unsupervised Remote Sensing Image Segmentation in Coastal Areas

As the fastest growing trend in big data analysis, deep learning technology has proven to be both an unprecedented breakthrough and a powerful tool in many fields, particularly for image segmentation tasks. Nevertheless, most achievements depend on high-quality pre-labeled training samples, which are labor-intensive and time-consuming. Furthermore, different from conventional natural images, coastal remote sensing ones generally carry far more complicated and considerable land cover information, making it difficult to produce pre-labeled references for supervised image segmentation. In our research, motivated by this observation, we take an in-depth investigation on the utilization of neural networks for unsupervised learning and propose a novel method, namely conditional co-training (CCT), specifically for truly unsupervised remote sensing image segmentation in coastal areas. In our idea, a multi-model framework consisting of two parallel data streams, which are superpixel-based over-segmentation and pixel-level semantic segmentation, is proposed to simultaneously perform the pixel-level classification. The former processes the input image into multiple over-segments, providing self-constrained guidance for model training. Meanwhile, with this guidance, the latter continuously processes the input image into multi-channel response maps until the model converges. Incentivized by multiple conditional constraints, our framework learns to extract high-level semantic knowledge and produce full-resolution segmentation maps without pre-labeled ground truths. Compared to the black-box solutions in conventional supervised learning manners, this method is of stronger explainability and transparency for its specific architecture and mechanism. The experimental results on two representative real-world coastal remote sensing datasets of image segmentation and the comparison with other state-of-the-art truly unsupervised methods validate the plausible performance and excellent efficiency of our proposed CCT.

Download Full-text

A Survey of Semantic Construction and Application of Satellite Remote Sensing Images and Data

Journal of Organizational and End User Computing ◽

10.4018/joeuc.20211101.oa6 ◽

2021 ◽

Vol 33 (6) ◽

pp. 1-20

Author(s):

Hui Lu ◽

Qi Liu ◽

Xiaodong Liu ◽

Yonghong Zhang

Keyword(s):

Remote Sensing ◽

Big Data ◽

Rapid Development ◽

Remote Sensing Data ◽

Semantic Segmentation ◽

Remote Sensing Image ◽

Semantic Knowledge ◽

Learning Technology ◽

Semantic Classification ◽

Sensing Data

With the rapid development of satellite technology, remote sensing data has entered the era of big data, and the intelligent processing of remote sensing image has been paid more and more attention. Through the semantic research of remote sensing data, the processing ability of remote sensing data is greatly improved. This paper aims to introduce and analyze the research and application progress of remote sensing image satellite data processing from the perspective of semantic. Firstly, it introduces the characteristics and semantic knowledge of remote sensing big data; Secondly, the semantic concept, semantic construction and application fields are introduced in detail; then, for remote sensing big data, the technical progress in the study field of semantic construction is analyzed from four aspects: semantic description and understanding, semantic segmentation, semantic classification and semantic search, focusing on deep learning technology; Finally, the problems and challenges in the four aspects are discussed in detail, in order to find more directions to explore.

Download Full-text

A Survey of Semantic Construction and Application of Satellite Remote Sensing Images and Data

Journal of Organizational and End User Computing ◽

10.4018/joeuc.20211101oa06 ◽

2021 ◽

Vol 33 (6) ◽

pp. 0-0

Keyword(s):

Remote Sensing ◽

Big Data ◽

Rapid Development ◽

Remote Sensing Data ◽

Semantic Segmentation ◽

Remote Sensing Image ◽

Semantic Knowledge ◽

Learning Technology ◽

Semantic Classification ◽

Sensing Data

Download Full-text

Semi-Supervised Remote Sensing Image Semantic Segmentation via Consistency Regularization and Average Update of Pseudo-Label

Remote Sensing ◽

10.3390/rs12213603 ◽

2020 ◽

Vol 12 (21) ◽

pp. 3603 ◽

Cited By ~ 1

Author(s):

Jiaxin Wang ◽

Chris H. Q. Ding ◽

Sibao Chen ◽

Chenggang He ◽

Bin Luo

Keyword(s):

Remote Sensing ◽

Image Segmentation ◽

Supervised Learning ◽

Semantic Segmentation ◽

Remote Sensing Image ◽

Unlabeled Data ◽

Training Method ◽

Remote Sensing Images ◽

Supervised Training ◽

Great Progress

Image segmentation has made great progress in recent years, but the annotation required for image segmentation is usually expensive, especially for remote sensing images. To solve this problem, we explore semi-supervised learning methods and appropriately utilize a large amount of unlabeled data to improve the performance of remote sensing image segmentation. This paper proposes a method for remote sensing image segmentation based on semi-supervised learning. We first design a Consistency Regularization (CR) training method for semi-supervised training, then employ the new learned model for Average Update of Pseudo-label (AUP), and finally combine pseudo labels and strong labels to train semantic segmentation network. We demonstrate the effectiveness of the proposed method on three remote sensing datasets, achieving better performance without more labeled data. Extensive experiments show that our semi-supervised method can learn the latent information from the unlabeled data to improve the segmentation performance.

Download Full-text

Fully Convolutional Neural Network with Augmented Atrous Spatial Pyramid Pool and Fully Connected Fusion Path for High Resolution Remote Sensing Image Segmentation

Applied Sciences ◽

10.3390/app9091816 ◽

2019 ◽

Vol 9 (9) ◽

pp. 1816 ◽

Cited By ~ 12

Author(s):

Guangsheng Chen ◽

Chao Li ◽

Wei Wei ◽

Weipeng Jing ◽

Marcin Woźniak ◽

...

Keyword(s):

Neural Network ◽

Remote Sensing ◽

Image Segmentation ◽

High Resolution ◽

Semantic Segmentation ◽

Remote Sensing Image ◽

Dilated Convolution ◽

Segmentation Task ◽

Fully Connected ◽

Spatial Pyramid

Recent developments in Convolutional Neural Networks (CNNs) have allowed for the achievement of solid advances in semantic segmentation of high-resolution remote sensing (HRRS) images. Nevertheless, the problems of poor classification of small objects and unclear boundaries caused by the characteristics of the HRRS image data have not been fully considered by previous works. To tackle these challenging problems, we propose an improved semantic segmentation neural network, which adopts dilated convolution, a fully connected (FC) fusion path and pre-trained encoder for the semantic segmentation task of HRRS imagery. The network is built with the computationally-efficient DeepLabv3 architecture, with added Augmented Atrous Spatial Pyramid Pool and FC Fusion Path layers. Dilated convolution enlarges the receptive field of feature points without decreasing the feature map resolution. The improved neural network architecture enhances HRRS image segmentation, reaching the classification accuracy of 91%, and the precision of recognition of small objects is improved. The applicability of the improved model to the remote sensing image segmentation task is verified.

Download Full-text

Top-Down Pyramid Fusion Network for High-Resolution Remote Sensing Semantic Segmentation

Remote Sensing ◽

10.3390/rs13204159 ◽

2021 ◽

Vol 13 (20) ◽

pp. 4159

Author(s):

Yuhang Gu ◽

Jie Hao ◽

Bing Chen ◽

Hai Deng

Keyword(s):

Remote Sensing ◽

High Resolution ◽

Feature Fusion ◽

Semantic Segmentation ◽

Semantic Knowledge ◽

Surface Model ◽

Top Down ◽

Segmentation Accuracy ◽

Fusion Methods ◽

High Level

In recent years, high-resolution remote sensing semantic segmentation based on data fusion has gradually become a research focus in the field of land classification, which is an indispensable task of a smart city. However, the existing feature fusion methods with bottom-up structures can achieve limited fusion results. Alternatively, various auxiliary fusion modules significantly increase the complexity of the models and make the training process intolerably expensive. In this paper, we propose a new lightweight model called top-down pyramid fusion network (TdPFNet) including a multi-source feature extractor, a top-down pyramid fusion module and a decoder. It can deeply fuse features from different sources in a top-down structure using high-level semantic knowledge guiding the fusion of low-level texture information. Digital surface model (DSM) data and open street map (OSM) data are used as auxiliary inputs to the Potsdam dataset for the proposed model evaluation. Experimental results show that the network proposed in this paper not only notably improves the segmentation accuracy, but also reduces the complexity of the multi-source semantic segmentation model.

Download Full-text

Efficient Transformer for Remote Sensing Image Segmentation

Remote Sensing ◽

10.3390/rs13183585 ◽

2021 ◽

Vol 13 (18) ◽

pp. 3585

Author(s):

Zhiyong Xu ◽

Weicun Zhang ◽

Tianxiang Zhang ◽

Zhifang Yang ◽

Jiangyun Li

Keyword(s):

Remote Sensing ◽

Image Segmentation ◽

Semantic Segmentation ◽

Remote Sensing Image ◽

Edge Classification ◽

Geological Surveys ◽

Object Edge ◽

Disaster Monitoring ◽

Transformer Model ◽

Computation Load

Semantic segmentation for remote sensing images (RSIs) is widely applied in geological surveys, urban resources management, and disaster monitoring. Recent solutions on remote sensing segmentation tasks are generally addressed by CNN-based models and transformer-based models. In particular, transformer-based architecture generally struggles with two main problems: a high computation load and inaccurate edge classification. Therefore, to overcome these problems, we propose a novel transformer model to realize lightweight edge classification. First, based on a Swin transformer backbone, a pure Efficient transformer with mlphead is proposed to accelerate the inference speed. Moreover, explicit and implicit edge enhancement methods are proposed to cope with object edge problems. The experimental results evaluated on the Potsdam and Vaihingen datasets present that the proposed approach significantly improved the final accuracy, achieving a trade-off between computational complexity (Flops) and accuracy (Efficient-L obtaining 3.23% mIoU improvement on Vaihingen and 2.46% mIoU improvement on Potsdam compared with HRCNet_W48). As a result, it is believed that the proposed Efficient transformer will have an advantage in dealing with remote sensing image segmentation problems.

Download Full-text