Intelligent Object Recognition of Urban Water Bodies Based on Deep Learning for Multi-Source and Multi-Temporal High Spatial Resolution Remote Sensing Imagery

High spatial resolution remote sensing image (HSRRSI) data provide rich texture, geometric structure, and spatial distribution information for surface water bodies. The rich detail information provides better representation of the internal components of each object category and better reflects the relationships between adjacent objects. In this context, recognition methods such as geographic object-based image analysis (GEOBIA) have improved significantly. However, these methods focus mainly on bottom-up classifications from visual features to semantic categories, but ignore top-down feedback which can optimize recognition results. In recent years, deep learning has been applied in the field of remote sensing measurements because of its powerful feature extraction ability. A special convolutional neural network (CNN) based region proposal generation and object detection integrated framework has greatly improved the performance of object detection for HSRRSI, which provides a new method for water body recognition based on remote sensing data. This study uses the excellent “self-learning ability” of deep learning to construct a modified structure of the Mask R-CNN method which integrates bottom-up and top-down processes for water recognition. Compared with traditional methods, our method is completely data-driven without prior knowledge, and it can be regarded as a novel technical procedure for water body recognition in practical engineering application. Experimental results indicate that the method produces accurate recognition results for multi-source and multi-temporal water bodies, and can effectively avoid confusion with shadows and other ground features.

Download Full-text

How Well Do Deep Learning-Based Methods for Land Cover Classification and Object Detection Perform on High Resolution Remote Sensing Imagery?

Remote Sensing ◽

10.3390/rs12030417 ◽

2020 ◽

Vol 12 (3) ◽

pp. 417 ◽

Cited By ~ 12

Author(s):

Xin Zhang ◽

Liangxiu Han ◽

Lianghao Han ◽

Liang Zhu

Keyword(s):

Remote Sensing ◽

Deep Learning ◽

High Resolution ◽

Land Cover ◽

Object Detection ◽

Spatial Resolution ◽

High Spatial Resolution ◽

Remote Sensing Data ◽

Land Cover Classification ◽

Sensing Data

Land cover information plays an important role in mapping ecological and environmental changes in Earth’s diverse landscapes for ecosystem monitoring. Remote sensing data have been widely used for the study of land cover, enabling efficient mapping of changes of the Earth surface from Space. Although the availability of high-resolution remote sensing imagery increases significantly every year, traditional land cover analysis approaches based on pixel and object levels are not optimal. Recent advancement in deep learning has achieved remarkable success on image recognition field and has shown potential in high spatial resolution remote sensing applications, including classification and object detection. In this paper, a comprehensive review on land cover classification and object detection approaches using high resolution imagery is provided. Through two case studies, we demonstrated the applications of the state-of-the-art deep learning models to high spatial resolution remote sensing data for land cover classification and object detection and evaluated their performances against traditional approaches. For a land cover classification task, the deep-learning-based methods provide an end-to-end solution by using both spatial and spectral information. They have shown better performance than the traditional pixel-based method, especially for the categories of different vegetation. For an objective detection task, the deep-learning-based object detection method achieved more than 98% accuracy in a large area; its high accuracy and efficiency could relieve the burden of the traditional, labour-intensive method. However, considering the diversity of remote sensing data, more training datasets are required in order to improve the generalisation and the robustness of deep learning-based models.

Download Full-text

High Spatial Resolution Remote Sensing Image Classification Based on Deep Learning

Acta Optica Sinica ◽

10.3788/aos201636.0428001 ◽

2016 ◽

Vol 36 (4) ◽

pp. 0428001 ◽

Cited By ~ 6

Author(s):

刘大伟 Liu Dawei ◽

韩玲 Han Ling ◽

韩晓勇 Han Xiaoyong

Keyword(s):

Remote Sensing ◽

Deep Learning ◽

Image Classification ◽

Spatial Resolution ◽

High Spatial Resolution ◽

Remote Sensing Image ◽

Remote Sensing Image Classification

Download Full-text

Rural settlements extraction based on deep learning from high spatial resolution remote sensing imagery

MIPPR 2019: Pattern Recognition and Computer Vision ◽

10.1117/12.2536991 ◽

2020 ◽

Author(s):

Qi Li ◽

Liang Hong ◽

Huiling Sun

Keyword(s):

Remote Sensing ◽

Deep Learning ◽

Spatial Resolution ◽

High Spatial Resolution ◽

Rural Settlements ◽

Remote Sensing Imagery

Download Full-text

Deep Learning Approaches for the Mapping of Tree Species Diversity in a Tropical Wetland Using Airborne LiDAR and High-Spatial-Resolution Remote Sensing Images

Forests ◽

10.3390/f10111047 ◽

2019 ◽

Vol 10 (11) ◽

pp. 1047 ◽

Cited By ~ 5

Author(s):

Ying Sun ◽

Jianfeng Huang ◽

Zurui Ao ◽

Dazhao Lao ◽

Qinchuan Xin

Keyword(s):

Remote Sensing ◽

Deep Learning ◽

Species Diversity ◽

Spatial Resolution ◽

Tree Species ◽

High Spatial Resolution ◽

Diversity Index ◽

Tree Species Diversity ◽

Individual Tree ◽

Rgb Images

The monitoring of tree species diversity is important for forest or wetland ecosystem service maintenance or resource management. Remote sensing is an efficient alternative to traditional field work to map tree species diversity over large areas. Previous studies have used light detection and ranging (LiDAR) and imaging spectroscopy (hyperspectral or multispectral remote sensing) for species richness prediction. The recent development of very high spatial resolution (VHR) RGB images has enabled detailed characterization of canopies and forest structures. In this study, we developed a three-step workflow for mapping tree species diversity, the aim of which was to increase knowledge of tree species diversity assessment using deep learning in a tropical wetland (Haizhu Wetland) in South China based on VHR-RGB images and LiDAR points. Firstly, individual trees were detected based on a canopy height model (CHM, derived from LiDAR points) by the local-maxima-based method in the FUSION software (Version 3.70, Seattle, USA). Then, tree species at the individual tree level were identified via a patch-based image input method, which cropped the RGB images into small patches (the individually detected trees) based on the tree apexes detected. Three different deep learning methods (i.e., AlexNet, VGG16, and ResNet50) were modified to classify the tree species, as they can make good use of the spatial context information. Finally, four diversity indices, namely, the Margalef richness index, the Shannon–Wiener diversity index, the Simpson diversity index, and the Pielou evenness index, were calculated from the fixed subset with a size of 30 × 30 m for assessment. In the classification phase, VGG16 had the best performance, with an overall accuracy of 73.25% for 18 tree species. Based on the classification results, mapping of tree species diversity showed reasonable agreement with field survey data (R2Margalef = 0.4562, root-mean-square error RMSEMargalef = 0.5629; R2Shannon–Wiener = 0.7948, RMSEShannon–Wiener = 0.7202; R2Simpson = 0.7907, RMSESimpson = 0.1038; and R2Pielou = 0.5875, RMSEPielou = 0.3053). While challenges remain for individual tree detection and species classification, the deep-learning-based solution shows potential for mapping tree species diversity.

Download Full-text

Detection of Collapsed Buildings in Post-Earthquake Remote Sensing Images Based on the Improved YOLOv3

Remote Sensing ◽

10.3390/rs12010044 ◽

2019 ◽

Vol 12 (1) ◽

pp. 44 ◽

Cited By ~ 10

Author(s):

Haojie Ma ◽

Yalan Liu ◽

Yuhuan Ren ◽

Jingxian Yu

Keyword(s):

Remote Sensing ◽

Object Detection ◽

Spatial Resolution ◽

High Spatial Resolution ◽

Detection Methods ◽

Remote Sensing Images ◽

Sensing Technology ◽

Collapsed Buildings ◽

Improved Model ◽

Detection Speed

An important and effective method for the preliminary mitigation and relief of an earthquake is the rapid estimation of building damage via high spatial resolution remote sensing technology. Traditional object detection methods only use artificially designed shallow features on post-earthquake remote sensing images, which are uncertain and complex background environment and time-consuming feature selection. The satisfactory results from them are often difficult. Therefore, this study aims to apply the object detection method You Only Look Once (YOLOv3) based on the convolutional neural network (CNN) to locate collapsed buildings from post-earthquake remote sensing images. Moreover, YOLOv3 was improved to obtain more effective detection results. First, we replaced the Darknet53 CNN in YOLOv3 with the lightweight CNN ShuffleNet v2. Second, the prediction box center point, XY loss, and prediction box width and height, WH loss, in the loss function was replaced with the generalized intersection over union (GIoU) loss. Experiments performed using the improved YOLOv3 model, with high spatial resolution aerial remote sensing images at resolutions of 0.5 m after the Yushu and Wenchuan earthquakes, show a significant reduction in the number of parameters, detection speed of up to 29.23 f/s, and target precision of 90.89%. Compared with the general YOLOv3, the detection speed improved by 5.21 f/s and its precision improved by 5.24%. Moreover, the improved model had stronger noise immunity capabilities, which indicates a significant improvement in the model’s generalization. Therefore, this improved YOLOv3 model is effective for the detection of collapsed buildings in post-earthquake high-resolution remote sensing images.

Download Full-text

Automatic Weakly Supervised Object Detection From High Spatial Resolution Remote Sensing Images via Dynamic Curriculum Learning

IEEE Transactions on Geoscience and Remote Sensing ◽

10.1109/tgrs.2020.2991407 ◽

2020 ◽

pp. 1-11 ◽

Cited By ~ 4

Author(s):

Xiwen Yao ◽

Xiaoxu Feng ◽

Junwei Han ◽

Gong Cheng ◽

Lei Guo

Keyword(s):

Remote Sensing ◽

Object Detection ◽

Spatial Resolution ◽

High Spatial Resolution ◽

Remote Sensing Images ◽

Weakly Supervised

Download Full-text

A SPIRAL-BASED DOWNSCALING METHOD FOR GENERATING 30 M TIME SERIES IMAGE DATA

ISPRS - International Archives of the Photogrammetry, Remote Sensing and Spatial Information Sciences ◽

10.5194/isprs-archives-xlii-2-w7-817-2017 ◽

2017 ◽

Vol XLII-2/W7 ◽

pp. 817-820

Author(s):

B. Liu ◽

J. Chen ◽

H. Xing ◽

H. Wu ◽

J. Zhang

Keyword(s):

Remote Sensing ◽

Time Series ◽

Land Cover ◽

Spatial Resolution ◽

High Spatial Resolution ◽

Image Data ◽

Resolution Time ◽

Simulated Experiment ◽

Land Cover Data ◽

Multi Temporal

The spatial detail and updating frequency of land cover data are important factors influencing land surface dynamic monitoring applications in high spatial resolution scale. However, the fragmentized patches and seasonal variable of some land cover types (e. g. small crop field, wetland) make it labor-intensive and difficult in the generation of land cover data. Utilizing the high spatial resolution multi-temporal image data is a possible solution. Unfortunately, the spatial and temporal resolution of available remote sensing data like Landsat or MODIS datasets can hardly satisfy the minimum mapping unit and frequency of current land cover mapping / updating at the same time. The generation of high resolution time series may be a compromise to cover the shortage in land cover updating process. One of popular way is to downscale multi-temporal MODIS data with other high spatial resolution auxiliary data like Landsat. But the usual manner of downscaling pixel based on a window may lead to the underdetermined problem in heterogeneous area, result in the uncertainty of some high spatial resolution pixels. Therefore, the downscaled multi-temporal data can hardly reach high spatial resolution as Landsat data. <br><br> A spiral based method was introduced to downscale low spatial and high temporal resolution image data to high spatial and high temporal resolution image data. By the way of searching the similar pixels around the adjacent region based on the spiral, the pixel set was made up in the adjacent region pixel by pixel. The underdetermined problem is prevented to a large extent from solving the linear system when adopting the pixel set constructed. With the help of ordinary least squares, the method inverted the endmember values of linear system. The high spatial resolution image was reconstructed on the basis of high spatial resolution class map and the endmember values band by band. Then, the high spatial resolution time series was formed with these high spatial resolution images image by image. <br><br> Simulated experiment and remote sensing image downscaling experiment were conducted. In simulated experiment, the 30 meters class map dataset Globeland30 was adopted to investigate the effect on avoid the underdetermined problem in downscaling procedure and a comparison between spiral and window was conducted. Further, the MODIS NDVI and Landsat image data was adopted to generate the 30m time series NDVI in remote sensing image downscaling experiment. Simulated experiment results showed that the proposed method had a robust performance in downscaling pixel in heterogeneous region and indicated that it was superior to the traditional window-based methods. The high resolution time series generated may be a benefit to the mapping and updating of land cover data.

Download Full-text