Deep-learning-based single-image height reconstruction from very-high-resolution SAR intensity data

Building extraction from very high resolution (VHR) imagery plays an important role in urban planning, disaster management, navigation, updating geographic databases, and several other geospatial applications. Compared with the traditional building extraction approaches, deep learning networks have recently shown outstanding performance in this task by using both high-level and low-level feature maps. However, it is difficult to utilize different level features rationally with the present deep learning networks. To tackle this problem, a novel network based on DenseNets and the attention mechanism was proposed, called the dense-attention network (DAN). The DAN contains an encoder part and a decoder part which are separately composed of lightweight DenseNets and a spatial attention fusion module. The proposed encoder–decoder architecture can strengthen feature propagation and effectively bring higher-level feature information to suppress the low-level feature and noises. Experimental results based on public international society for photogrammetry and remote sensing (ISPRS) datasets with only red–green–blue (RGB) images demonstrated that the proposed DAN achieved a higher score (96.16% overall accuracy (OA), 92.56% F1 score, 90.56% mean intersection over union (MIOU), less training and response time and higher-quality value) when compared with other deep learning methods.

Download Full-text

Road Extraction from Very-High-Resolution Remote Sensing Images via a Nested SE-Deeplab Model

Remote Sensing ◽

10.3390/rs12182985 ◽

2020 ◽

Vol 12 (18) ◽

pp. 2985 ◽

Cited By ~ 1

Author(s):

Yeneng Lin ◽

Dongyun Xu ◽

Nan Wang ◽

Zhou Shi ◽

Qiuxiao Chen

Keyword(s):

Remote Sensing ◽

Deep Learning ◽

High Resolution ◽

Network Models ◽

Road Extraction ◽

Remote Sensing Images ◽

Proposed Model ◽

Wide Range ◽

Network Modules ◽

Very High

Automatic road extraction from very-high-resolution remote sensing images has become a popular topic in a wide range of fields. Convolutional neural networks are often used for this purpose. However, many network models do not achieve satisfactory extraction results because of the elongated nature and varying sizes of roads in images. To improve the accuracy of road extraction, this paper proposes a deep learning model based on the structure of Deeplab v3. It incorporates squeeze-and-excitation (SE) module to apply weights to different feature channels, and performs multi-scale upsampling to preserve and fuse shallow and deep information. To solve the problems associated with unbalanced road samples in images, different loss functions and backbone network modules are tested in the model’s training process. Compared with cross entropy, dice loss can improve the performance of the model during training and prediction. The SE module is superior to ResNext and ResNet in improving the integrity of the extracted roads. Experimental results obtained using the Massachusetts Roads Dataset show that the proposed model (Nested SE-Deeplab) improves F1-Score by 2.4% and Intersection over Union by 2.0% compared with FC-DenseNet. The proposed model also achieves better segmentation accuracy in road extraction compared with other mainstream deep-learning models including Deeplab v3, SegNet, and UNet.

Download Full-text

A Multi-Task Deep Learning Framework Coupling Semantic Segmentation and Image Reconstruction for Very High Resolution Imagery

IGARSS 2019 - 2019 IEEE International Geoscience and Remote Sensing Symposium ◽

10.1109/igarss.2019.8898133 ◽

2019 ◽

Author(s):

Maria Papadomanolaki ◽

Konstantinos Karantzalos ◽

Maria Vakalopoulou

Keyword(s):

Deep Learning ◽

High Resolution ◽

Image Reconstruction ◽

Semantic Segmentation ◽

Learning Framework ◽

High Resolution Imagery ◽

Very High Resolution Imagery ◽

Very High

Download Full-text

A Comparison of Deep Learning Architectures for Semantic Mapping of Very High Resolution Images

IGARSS 2018 - 2018 IEEE International Geoscience and Remote Sensing Symposium ◽

10.1109/igarss.2018.8518533 ◽

2018 ◽

Cited By ~ 4

Author(s):

Qinghui Liu ◽

Arnt-Borre Salberg ◽

Robert Jenssen

Keyword(s):

Deep Learning ◽

High Resolution ◽

Semantic Mapping ◽

High Resolution Images ◽

Learning Architectures ◽

Very High

Download Full-text

Deep learning for dense labeling of hydrographic regions in very high resolution imagery

Image and Signal Processing for Remote Sensing XXV ◽

10.1117/12.2533161 ◽

2019 ◽

Cited By ~ 1

Author(s):

Vladimir V. Kniaz

Keyword(s):

Deep Learning ◽

High Resolution ◽

High Resolution Imagery ◽

Very High Resolution Imagery ◽

Hydrographic Regions ◽

Very High

Download Full-text

Building Extraction in Very High Resolution Remote Sensing Imagery Using Deep Learning and Guided Filters

Remote Sensing ◽

10.3390/rs10010144 ◽

2018 ◽

Vol 10 (1) ◽

pp. 144 ◽

Cited By ~ 114

Author(s):

Yongyang Xu ◽

Liang Wu ◽

Zhong Xie ◽

Zhanlong Chen

Keyword(s):

Remote Sensing ◽

Deep Learning ◽

High Resolution ◽

Building Extraction ◽

Remote Sensing Imagery ◽

Very High

Download Full-text

DEEP LEARNING BASED ROOF TYPE CLASSIFICATION USING VERY HIGH RESOLUTION AERIAL IMAGERY

ISPRS - International Archives of the Photogrammetry, Remote Sensing and Spatial Information Sciences ◽

10.5194/isprs-archives-xliii-b3-2021-55-2021 ◽

2021 ◽

Vol XLIII-B3-2021 ◽

pp. 55-60

Author(s):

M. Buyukdemircioglu ◽

R. Can ◽

S. Kocaman

Keyword(s):

Deep Learning ◽

High Resolution ◽

Urban Areas ◽

Image Features ◽

Training Data ◽

Fine Tuning ◽

Computer Hardware ◽

Geographical Information ◽

Training Dataset ◽

Very High

Abstract. Automatic detection, segmentation and reconstruction of buildings in urban areas from Earth Observation (EO) data are still challenging for many researchers. Roof is one of the most important element in a building model. The three-dimensional geographical information system (3D GIS) applications generally require the roof type and roof geometry for performing various analyses on the models, such as energy efficiency. The conventional segmentation and classification methods are often based on features like corners, edges and line segments. In parallel to the developments in computer hardware and artificial intelligence (AI) methods including deep learning (DL), image features can be extracted automatically. As a DL technique, convolutional neural networks (CNNs) can also be used for image classification tasks, but require large amount of high quality training data for obtaining accurate results. The main aim of this study was to generate a roof type dataset from very high-resolution (10 cm) orthophotos of Cesme, Turkey, and to classify the roof types using a shallow CNN architecture. The training dataset consists 10,000 roof images and their labels. Six roof type classes such as flat, hip, half-hip, gable, pyramid and complex roofs were used for the classification in the study area. The prediction performance of the shallow CNN model used here was compared with the results obtained from the fine-tuning of three well-known pre-trained networks, i.e. VGG-16, EfficientNetB4, ResNet-50. The results show that although our CNN has slightly lower performance expressed with the overall accuracy, it is still acceptable for many applications using sparse data.

Download Full-text