Mining Mid-Level Visual Elements for Object Detection in High-Resolution Remote Sensing Images

Instance segmentation in high-resolution (HR) remote sensing imagery is one of the most challenging tasks and is more difficult than object detection and semantic segmentation tasks. It aims to predict class labels and pixel-wise instance masks to locate instances in an image. However, there are rare methods currently suitable for instance segmentation in the HR remote sensing images. Meanwhile, it is more difficult to implement instance segmentation due to the complex background of remote sensing images. In this article, a novel instance segmentation approach of HR remote sensing imagery based on Cascade Mask R-CNN is proposed, which is called a high-quality instance segmentation network (HQ-ISNet). In this scheme, the HQ-ISNet exploits a HR feature pyramid network (HRFPN) to fully utilize multi-level feature maps and maintain HR feature maps for remote sensing images’ instance segmentation. Next, to refine mask information flow between mask branches, the instance segmentation network version 2 (ISNetV2) is proposed to promote further improvements in mask prediction accuracy. Then, we construct a new, more challenging dataset based on the synthetic aperture radar (SAR) ship detection dataset (SSDD) and the Northwestern Polytechnical University very-high-resolution 10-class geospatial object detection dataset (NWPU VHR-10) for remote sensing images instance segmentation which can be used as a benchmark for evaluating instance segmentation algorithms in the high-resolution remote sensing images. Finally, extensive experimental analyses and comparisons on the SSDD and the NWPU VHR-10 dataset show that (1) the HRFPN makes the predicted instance masks more accurate, which can effectively enhance the instance segmentation performance of the high-resolution remote sensing imagery; (2) the ISNetV2 is effective and promotes further improvements in mask prediction accuracy; (3) our proposed framework HQ-ISNet is effective and more accurate for instance segmentation in the remote sensing imagery than the existing algorithms.

Download Full-text

CSVM Architectures for Pixel-Wise Object Detection in High-Resolution Remote Sensing Images

IEEE Transactions on Geoscience and Remote Sensing ◽

10.1109/tgrs.2020.2972289 ◽

2020 ◽

Vol 58 (9) ◽

pp. 6059-6070

Author(s):

Youyou Li ◽

Farid Melgani ◽

Binbin He

Keyword(s):

Remote Sensing ◽

High Resolution ◽

Object Detection ◽

Remote Sensing Images

Download Full-text

Weighted Ensemble Object Detection with Optimized Coefficients for Remote Sensing Images

ISPRS International Journal of Geo-Information ◽

10.3390/ijgi9060370 ◽

2020 ◽

Vol 9 (6) ◽

pp. 370

Author(s):

Atakan Körez ◽

Necaattin Barışçı ◽

Aydın Çetin ◽

Uçman Ergün

Keyword(s):

Remote Sensing ◽

High Resolution ◽

Object Detection ◽

Mean Average Precision ◽

Detection Methods ◽

Remote Sensing Images ◽

Average Precision ◽

Proposed Model ◽

Detection Of Objects ◽

Very High

The detection of objects in very high-resolution (VHR) remote sensing images has become increasingly popular with the enhancement of remote sensing technologies. High-resolution images from aircrafts or satellites contain highly detailed and mixed backgrounds that decrease the success of object detection in remote sensing images. In this study, a model that performs weighted ensemble object detection using optimized coefficients is proposed. This model uses the outputs of three different object detection models trained on the same dataset. The model’s structure takes two or more object detection methods as its input and provides an output with an optimized coefficient-weighted ensemble. The Northwestern Polytechnical University Very High Resolution 10 (NWPU-VHR10) and Remote Sensing Object Detection (RSOD) datasets were used to measure the object detection success of the proposed model. Our experiments reveal that the proposed model improved the Mean Average Precision (mAP) performance by 0.78%–16.5% compared to stand-alone models and presents better mean average precision than other state-of-the-art methods (3.55% higher on the NWPU-VHR-10 dataset and 1.49% higher when using the RSOD dataset).

Download Full-text

Patch-Based Three-Stage Aggregation Network for Object Detection in High Resolution Remote Sensing Images

IEEE Access ◽

10.1109/access.2020.3027044 ◽

2020 ◽

Vol 8 ◽

pp. 184934-184944

Author(s):

Bing Sui ◽

Meng Xu ◽

Feng Gao

Keyword(s):

Remote Sensing ◽

High Resolution ◽

Object Detection ◽

Remote Sensing Images

Download Full-text