Multi-scale attentive region adaptive aggregation learning for remote sensing scene classification

Remote sensing images contain complex backgrounds and multi-scale objects, which pose a challenging task for scene classification. The performance is highly dependent on the capacity of the scene representation as well as the discriminability of the classifier. Although multiple models possess better properties than a single model on these aspects, the fusion strategy for these models is a key component to maximize the final accuracy. In this paper, we construct a novel dual-model architecture with a grouping-attention-fusion strategy to improve the performance of scene classification. Specifically, the model employs two different convolutional neural networks (CNNs) for feature extraction, where the grouping-attention-fusion strategy is used to fuse the features of the CNNs in a fine and multi-scale manner. In this way, the resultant feature representation of the scene is enhanced. Moreover, to address the issue of similar appearances between different scenes, we develop a loss function which encourages small intra-class diversities and large inter-class distances. Extensive experiments are conducted on four scene classification datasets include the UCM land-use dataset, the WHU-RS19 dataset, the AID dataset, and the OPTIMAL-31 dataset. The experimental results demonstrate the superiority of the proposed method in comparison with the state-of-the-arts.

Download Full-text

Multi-Scale Convolutional Neural Network for Remote Sensing Scene Classification

2018 IEEE International Conference on Electro/Information Technology (EIT) ◽

10.1109/eit.2018.8500107 ◽

2018 ◽

Cited By ~ 5

Author(s):

Haikel Alhichri ◽

Naif Alajlan ◽

Yakoub Bazi ◽

Timon Rabczuk

Keyword(s):

Neural Network ◽

Remote Sensing ◽

Convolutional Neural Network ◽

Scene Classification ◽

Multi Scale

Download Full-text

Multi-Scale Meta-Learning-Based Networks for High-Resolution Remote Sensing Scene Classification

10.1109/igarss47720.2021.9555134 ◽

2021 ◽

Author(s):

Xu Tang ◽

Weiquan Lin ◽

Chao Liu ◽

Xiao Han ◽

Wenjing Wang ◽

...

Keyword(s):

Remote Sensing ◽

High Resolution ◽

Scene Classification ◽

Multi Scale ◽

Meta Learning

Download Full-text

SEMSDNet: A Multi-Scale Dense Network with Attention for Remote Sensing Scene Classification

IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing ◽

10.1109/jstars.2021.3074508 ◽

2021 ◽

pp. 1-1

Author(s):

Tian Tian ◽

Lingling Li ◽

Weitao Chen ◽

Huabing Zhou

Keyword(s):

Remote Sensing ◽

Dense Network ◽

Scene Classification ◽

Multi Scale

Download Full-text

A Multi-Scale Feature Aggregation Network Based on Channel-Spatial Attention for Remote Sensing Scene Classification

10.1109/igarss47720.2021.9554855 ◽

2021 ◽

Author(s):

Ming Li ◽

Lin Lei ◽

Xiao Li ◽

Yuli Sun

Keyword(s):

Remote Sensing ◽

Spatial Attention ◽

Scene Classification ◽

Scale Feature ◽

Multi Scale ◽

Feature Aggregation

Download Full-text

A Multi-Scale Approach for Remote Sensing Scene Classification Based on Feature Maps Selection and Region Representation

Remote Sensing ◽

10.3390/rs11212504 ◽

2019 ◽

Vol 11 (21) ◽

pp. 2504 ◽

Cited By ~ 3

Author(s):

Jun Zhang ◽

Min Zhang ◽

Lukui Shi ◽

Wenjie Yan ◽

Bin Pan

Keyword(s):

Remote Sensing ◽

High Resolution ◽

Remote Sensing Image ◽

Feature Representation ◽

Features Selection ◽

Scene Classification ◽

Feature Maps ◽

Global Features ◽

Convolutional Networks ◽

Multi Scale

Scene classification is one of the bases for automatic remote sensing image interpretation. Recently, deep convolutional neural networks have presented promising performance in high-resolution remote sensing scene classification research. In general, most researchers directly use raw deep features extracted from the convolutional networks to classify scenes. However, this strategy only considers single scale features, which cannot describe both the local and global features of images. In fact, the dissimilarity of scene targets in the same category may result in convolutional features being unable to classify them into the same category. Besides, the similarity of the global features in different categories may also lead to failure of fully connected layer features to distinguish them. To address these issues, we propose a scene classification method based on multi-scale deep feature representation (MDFR), which mainly includes two contributions: (1) region-based features selection and representation; and (2) multi-scale features fusion. Initially, the proposed method filters the multi-scale deep features extracted from pre-trained convolutional networks. Subsequently, these features are fused via two efficient fusion methods. Our method utilizes the complementarity between local features and global features by effectively exploiting the features of different scales and discarding the redundant information in features. Experimental results on three benchmark high-resolution remote sensing image datasets indicate that the proposed method is comparable to some state-of-the-art algorithms.

Download Full-text

Remote Sensing Image Scene Classification Using Multi-Scale Completed Local Binary Patterns and Fisher Vectors

Remote Sensing ◽

10.3390/rs8060483 ◽

2016 ◽

Vol 8 (6) ◽

pp. 483 ◽

Cited By ~ 66

Author(s):

Longhui Huang ◽

Chen Chen ◽

Wei Li ◽

Qian Du

Keyword(s):

Remote Sensing ◽

Local Binary Patterns ◽

Remote Sensing Image ◽

Scene Classification ◽

Multi Scale

Download Full-text

Multi-scale stacking attention pooling for remote sensing scene classification

Neurocomputing ◽

10.1016/j.neucom.2021.01.038 ◽

2021 ◽

Vol 436 ◽

pp. 147-161

Author(s):

Qi Bi ◽

Han Zhang ◽

Kun Qin

Keyword(s):

Remote Sensing ◽

Scene Classification ◽

Multi Scale

Download Full-text

Band-Wise Multi-Scale CNN Architecture for Remote Sensing Image Scene Classification

IGARSS 2020 - 2020 IEEE International Geoscience and Remote Sensing Symposium ◽

10.1109/igarss39084.2020.9323214 ◽

2020 ◽

Author(s):

Jian Kang ◽

Begum Demir

Keyword(s):

Remote Sensing ◽

Remote Sensing Image ◽

Scene Classification ◽

Multi Scale

Download Full-text

SAFFNet: Self-Attention-Based Feature Fusion Network for Remote Sensing Few-Shot Scene Classification

Remote Sensing ◽

10.3390/rs13132532 ◽

2021 ◽

Vol 13 (13) ◽

pp. 2532

Author(s):

Joseph Kim ◽

Mingmin Chi

Keyword(s):

Remote Sensing ◽

Large Scale ◽

Feature Fusion ◽

Feature Weighting ◽

Training Dataset ◽

Scene Classification ◽

Remote Sensing Classification ◽

Scale Feature ◽

Multi Scale ◽

Shot Classification

In real applications, it is necessary to classify new unseen classes that cannot be acquired in training datasets. To solve this problem, few-shot learning methods are usually adopted to recognize new categories with only a few (out-of-bag) labeled samples together with the known classes available in the (large-scale) training dataset. Unlike common scene classification images obtained by CCD (Charge-Coupled Device) cameras, remote sensing scene classification datasets tend to have plentiful texture features rather than shape features. Therefore, it is important to extract more valuable texture semantic features from a limited number of labeled input images. In this paper, a multi-scale feature fusion network for few-shot remote sensing scene classification is proposed by integrating a novel self-attention feature selection module, denoted as SAFFNet. Unlike a pyramidal feature hierarchy for object detection, the informative representations of the images with different receptive fields are automatically selected and re-weighted for feature fusion after refining network and global pooling operation for a few-shot remote sensing classification task. Here, the feature weighting value can be fine-tuned by the support set in the few-shot learning task. The proposed model is evaluated on three publicly available datasets for few shot remote sensing scene classification. Experimental results demonstrate the effectiveness of the proposed SAFFNet to improve the few-shot classification accuracy significantly compared to other few-shot methods and the typical multi-scale feature fusion network.

Download Full-text