scholarly journals Aircraft Detection in Remote Sensing Image based on Multi-scale Convolution Neural Network with Attention Mechanism

Author(s):  
Xianfeng Wang ◽  
Changqing Yu ◽  
Lei Huang ◽  
Shanwen Zhang

Abstract Detecting aircraft from remote sensing image (RSI) is an important but challenging task due to the variations of aircraft type, size, pose, angle, complex background and small size of aircraft in RSIs. An aircraft detection method is proposed based on multi-scale convolution neural network with attention (MSCNNA), consisting of encoder, decoder, attention and classification. In MSCNNA, the multiple convolutional and pooling kernels with different sizes are utilized to learn the multi-scale discriminant features, and the global attention mechanism (GAM) is employed to capture the spatial and channel dependencies and adaptively preserve the relationships of the entire image. Compared with the standard deep CNN, multi-scale convolution neural networks (CNN) and GAM are integrated to learn the multi-scale features for aircraft detection, especially small aircrafts. Experiment results on the aircraft image dataset of the public EORSSD dataset show that the proposed method outperforms the state-of-the-art method on the same dataset and the obtained multi-size aircraft edge is clearer.

2018 ◽  
Vol 55 (2) ◽  
pp. 022802
Author(s):  
方旭 Fang Xu ◽  
王光辉 Wang Guanghui ◽  
杨化超 Yang Huachao ◽  
刘慧杰 Liu Huijie ◽  
闫立波 Yan Libo

2021 ◽  
Vol 13 (13) ◽  
pp. 2457
Author(s):  
Xuan Wu ◽  
Zhijie Zhang ◽  
Wanchang Zhang ◽  
Yaning Yi ◽  
Chuanrong Zhang ◽  
...  

Convolutional neural network (CNN) is capable of automatically extracting image features and has been widely used in remote sensing image classifications. Feature extraction is an important and difficult problem in current research. In this paper, data augmentation for avoiding over fitting was attempted to enrich features of samples to improve the performance of a newly proposed convolutional neural network with UC-Merced and RSI-CB datasets for remotely sensed scene classifications. A multiple grouped convolutional neural network (MGCNN) for self-learning that is capable of promoting the efficiency of CNN was proposed, and the method of grouping multiple convolutional layers capable of being applied elsewhere as a plug-in model was developed. Meanwhile, a hyper-parameter C in MGCNN is introduced to probe into the influence of different grouping strategies for feature extraction. Experiments on the two selected datasets, the RSI-CB dataset and UC-Merced dataset, were carried out to verify the effectiveness of this newly proposed convolutional neural network, the accuracy obtained by MGCNN was 2% higher than the ResNet-50. An algorithm of attention mechanism was thus adopted and incorporated into grouping processes and a multiple grouped attention convolutional neural network (MGCNN-A) was therefore constructed to enhance the generalization capability of MGCNN. The additional experiments indicate that the incorporation of the attention mechanism to MGCNN slightly improved the accuracy of scene classification, but the robustness of the proposed network was enhanced considerably in remote sensing image classifications.


Sign in / Sign up

Export Citation Format

Share Document