M2-Net: A Multi-scale Multi-level Feature Enhanced Network for Object Detection in Optical Remote Sensing Images

Although remarkable progress has been made in salient object detection (SOD) in natural scene images (NSI), the SOD of optical remote sensing images (RSI) still faces significant challenges due to various spatial resolutions, cluttered backgrounds, and complex imaging conditions, mainly for two reasons: (1) accurate location of salient objects; and (2) subtle boundaries of salient objects. This paper explores the inherent properties of multi-level features to develop a novel semantic-guided attention refinement network (SARNet) for SOD of NSI. Specifically, the proposed semantic guided decoder (SGD) roughly but accurately locates the multi-scale object by aggregating multiple high-level features, and then this global semantic information guides the integration of subsequent features in a step-by-step feedback manner to make full use of deep multi-level features. Simultaneously, the proposed parallel attention fusion (PAF) module combines cross-level features and semantic-guided information to refine the object’s boundary and highlight the entire object area gradually. Finally, the proposed network architecture is trained through an end-to-end fully supervised model. Quantitative and qualitative evaluations on two public RSI datasets and additional NSI datasets across five metrics show that our SARNet is superior to 14 state-of-the-art (SOTA) methods without any post-processing.

Download Full-text

Multi-scale Object Detection in Optical Remote Sensing Images Using Atrous Feature Pyramid Network

Neural Information Processing - Lecture Notes in Computer Science ◽

10.1007/978-3-030-63830-6_39 ◽

2020 ◽

pp. 461-472

Author(s):

Mei Yu ◽

Minyutong Cheng ◽

Han Jiang ◽

Jining Shen ◽

Ruiguo Yu ◽

...

Keyword(s):

Remote Sensing ◽

Object Detection ◽

Optical Remote Sensing ◽

Remote Sensing Images ◽

Multi Scale ◽

Feature Pyramid

Download Full-text

A Multi-Scale Spatial Attention Region Proposal Network for High-Resolution Optical Remote Sensing Imagery

Remote Sensing ◽

10.3390/rs13173362 ◽

2021 ◽

Vol 13 (17) ◽

pp. 3362

Author(s):

Ruchan Dong ◽

Licheng Jiao ◽

Yan Zhang ◽

Jin Zhao ◽

Weiyan Shen

Keyword(s):

Remote Sensing ◽

High Resolution ◽

Object Detection ◽

Spatial Attention ◽

Recall Rate ◽

Optical Remote Sensing ◽

Remote Sensing Images ◽

Remote Sensing Imagery ◽

Multi Scale ◽

Backbone Network

Deep convolutional neural networks (DCNNs) are driving progress in object detection of high-resolution remote sensing images. Region proposal generation, as one of the key steps in object detection, has also become the focus of research. High-resolution remote sensing images usually contain various sizes of objects and complex background, small objects are easy to miss or be mis-identified in object detection. If the recall rate of region proposal of small objects and multi-scale objects can be improved, it will bring an improvement on the performance of the accuracy in object detection. Spatial attention is the ability to focus on local features in images and can improve the learning efficiency of DCNNs. This study proposes a multi-scale spatial attention region proposal network (MSA-RPN) for high-resolution optical remote sensing imagery. The MSA-RPN is an end-to-end deep learning network with a backbone network of ResNet. It deploys three novel modules to fulfill its task. First, the Scale-specific Feature Gate (SFG) focuses on features of objects by processing multi-scale features extracted from the backbone network. Second, the spatial attention-guided model (SAGM) obtains spatial information of objects from the multi-scale attention maps. Third, the Selective Strong Attention Maps Model (SSAMM) adaptively selects sliding windows according to the loss values from the system’s feedback, and sends the windowed samples to the spatial attention decoder. Finally, the candidate regions and their corresponding confidences can be obtained. We evaluate the proposed network in a public dataset LEVIR and compare with several state-of-the-art methods. The proposed MSA-RPN yields a higher recall rate of region proposal generation, especially for small targets in remote sensing images.

Download Full-text