Multiscale Semantic Fusion-Guided Fractal Convolutional Object Detection Network for Optical Remote Sensing Imagery

Author(s):  
Tong Zhang ◽  
Yin Zhuang ◽  
Guanqun Wang ◽  
Shan Dong ◽  
He Chen ◽  
...  
IEEE Access ◽  
2019 ◽  
Vol 7 ◽  
pp. 87150-87161 ◽  
Author(s):  
Songze Bao ◽  
Xing Zhong ◽  
Ruifei Zhu ◽  
Xiaonan Zhang ◽  
Zhuqiang Li ◽  
...  

2021 ◽  
Vol 13 (17) ◽  
pp. 3362
Author(s):  
Ruchan Dong ◽  
Licheng Jiao ◽  
Yan Zhang ◽  
Jin Zhao ◽  
Weiyan Shen

Deep convolutional neural networks (DCNNs) are driving progress in object detection of high-resolution remote sensing images. Region proposal generation, as one of the key steps in object detection, has also become the focus of research. High-resolution remote sensing images usually contain various sizes of objects and complex background, small objects are easy to miss or be mis-identified in object detection. If the recall rate of region proposal of small objects and multi-scale objects can be improved, it will bring an improvement on the performance of the accuracy in object detection. Spatial attention is the ability to focus on local features in images and can improve the learning efficiency of DCNNs. This study proposes a multi-scale spatial attention region proposal network (MSA-RPN) for high-resolution optical remote sensing imagery. The MSA-RPN is an end-to-end deep learning network with a backbone network of ResNet. It deploys three novel modules to fulfill its task. First, the Scale-specific Feature Gate (SFG) focuses on features of objects by processing multi-scale features extracted from the backbone network. Second, the spatial attention-guided model (SAGM) obtains spatial information of objects from the multi-scale attention maps. Third, the Selective Strong Attention Maps Model (SSAMM) adaptively selects sliding windows according to the loss values from the system’s feedback, and sends the windowed samples to the spatial attention decoder. Finally, the candidate regions and their corresponding confidences can be obtained. We evaluate the proposed network in a public dataset LEVIR and compare with several state-of-the-art methods. The proposed MSA-RPN yields a higher recall rate of region proposal generation, especially for small targets in remote sensing images.


Author(s):  
Guanqun Wang ◽  
Yin Zhuang ◽  
He Chen ◽  
Xiang Liu ◽  
Tong Zhang ◽  
...  

Sign in / Sign up

Export Citation Format

Share Document