A Multi-Scale Approach for Remote Sensing Scene Classification Based on Feature Maps Selection and Region Representation

Scene classification is one of the bases for automatic remote sensing image interpretation. Recently, deep convolutional neural networks have presented promising performance in high-resolution remote sensing scene classification research. In general, most researchers directly use raw deep features extracted from the convolutional networks to classify scenes. However, this strategy only considers single scale features, which cannot describe both the local and global features of images. In fact, the dissimilarity of scene targets in the same category may result in convolutional features being unable to classify them into the same category. Besides, the similarity of the global features in different categories may also lead to failure of fully connected layer features to distinguish them. To address these issues, we propose a scene classification method based on multi-scale deep feature representation (MDFR), which mainly includes two contributions: (1) region-based features selection and representation; and (2) multi-scale features fusion. Initially, the proposed method filters the multi-scale deep features extracted from pre-trained convolutional networks. Subsequently, these features are fused via two efficient fusion methods. Our method utilizes the complementarity between local features and global features by effectively exploiting the features of different scales and discarding the redundant information in features. Experimental results on three benchmark high-resolution remote sensing image datasets indicate that the proposed method is comparable to some state-of-the-art algorithms.

Download Full-text

Densely Based Multi-Scale and Multi-Modal Fully Convolutional Networks for High-Resolution Remote-Sensing Image Semantic Segmentation

IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing ◽

10.1109/jstars.2019.2906387 ◽

2019 ◽

Vol 12 (8) ◽

pp. 2612-2626 ◽

Cited By ~ 10

Author(s):

Cheng Peng ◽

Yangyang Li ◽

Licheng Jiao ◽

Yanqiao Chen ◽

Ronghua Shang

Keyword(s):

Remote Sensing ◽

High Resolution ◽

Semantic Segmentation ◽

Remote Sensing Image ◽

Convolutional Networks ◽

Multi Scale ◽

Fully Convolutional Networks

Download Full-text

An End-to-End Local-Global-Fusion Feature Extraction Network for Remote Sensing Image Scene Classification

Remote Sensing ◽

10.3390/rs11243006 ◽

2019 ◽

Vol 11 (24) ◽

pp. 3006 ◽

Cited By ~ 4

Author(s):

Yafei Lv ◽

Xiaohan Zhang ◽

Wei Xiong ◽

Yaqi Cui ◽

Mi Cai

Keyword(s):

Remote Sensing ◽

Feature Extraction ◽

Feature Fusion ◽

Remote Sensing Image ◽

Local Features ◽

Feature Representation ◽

Scene Classification ◽

Global Features ◽

End To End ◽

Fusion Feature

Remote sensing image scene classification (RSISC) is an active task in the remote sensing community and has attracted great attention due to its wide applications. Recently, the deep convolutional neural networks (CNNs)-based methods have witnessed a remarkable breakthrough in performance of remote sensing image scene classification. However, the problem that the feature representation is not discriminative enough still exists, which is mainly caused by the characteristic of inter-class similarity and intra-class diversity. In this paper, we propose an efficient end-to-end local-global-fusion feature extraction (LGFFE) network for a more discriminative feature representation. Specifically, global and local features are extracted from channel and spatial dimensions respectively, based on a high-level feature map from deep CNNs. For the local features, a novel recurrent neural network (RNN)-based attention module is first proposed to capture the spatial layout information and context information across different regions. Gated recurrent units (GRUs) is then exploited to generate the important weight of each region by taking a sequence of features from image patches as input. A reweighed regional feature representation can be obtained by focusing on the key region. Then, the final feature representation can be acquired by fusing the local and global features. The whole process of feature extraction and feature fusion can be trained in an end-to-end manner. Finally, extensive experiments have been conducted on four public and widely used datasets and experimental results show that our method LGFFE outperforms baseline methods and achieves state-of-the-art results.

Download Full-text

Semantic Multigranularity Feature Learning for High-Resolution Remote Sensing Image Scene Classification

Applied Sciences ◽

10.3390/app11199204 ◽

2021 ◽

Vol 11 (19) ◽

pp. 9204

Author(s):

Xinyi Ma ◽

Zhifeng Xiao ◽

Hong-sik Yun ◽

Seung-Jun Lee

Keyword(s):

Remote Sensing ◽

High Resolution ◽

Spatial Information ◽

Feature Learning ◽

Remote Sensing Image ◽

Input Image ◽

Training Data ◽

Aerial Image ◽

Scene Classification ◽

Feature Maps

High-resolution remote sensing image scene classification is a challenging visual task due to the large intravariance and small intervariance between the categories. To accurately recognize the scene categories, it is essential to learn discriminative features from both global and local critical regions. Recent efforts focus on how to encourage the network to learn multigranularity features with the destruction of the spatial information on the input image at different scales, which leads to meaningless edges that are harmful to training. In this study, we propose a novel method named Semantic Multigranularity Feature Learning Network (SMGFL-Net) for remote sensing image scene classification. The core idea is to learn both global and multigranularity local features from rearranged intermediate feature maps, thus, eliminating the meaningless edges. These features are then fused for the final prediction. Our proposed framework is compared with a collection of state-of-the-art (SOTA) methods on two fine-grained remote sensing image scene datasets, including the NWPU-RESISC45 and Aerial Image Datasets (AID). We justify several design choices, including the branch granularities, fusion strategies, pooling operations, and necessity of feature map rearrangement through a comparative study. Moreover, the overall performance results show that SMGFL-Net consistently outperforms other peer methods in classification accuracy, and the superiority is more apparent with less training data, demonstrating the efficacy of feature learning of our approach.

Download Full-text

A Dual-Model Architecture with Grouping-Attention-Fusion for Remote Sensing Scene Classification

Remote Sensing ◽

10.3390/rs13030433 ◽

2021 ◽

Vol 13 (3) ◽

pp. 433

Author(s):

Junge Shen ◽

Tong Zhang ◽

Yichen Wang ◽

Ruxin Wang ◽

Qi Wang ◽

...

Keyword(s):

Remote Sensing ◽

Feature Representation ◽

Dual Model ◽

Scene Classification ◽

Remote Sensing Images ◽

Single Model ◽

Fusion Strategy ◽

Multi Scale ◽

The Arts ◽

Scene Representation

Remote sensing images contain complex backgrounds and multi-scale objects, which pose a challenging task for scene classification. The performance is highly dependent on the capacity of the scene representation as well as the discriminability of the classifier. Although multiple models possess better properties than a single model on these aspects, the fusion strategy for these models is a key component to maximize the final accuracy. In this paper, we construct a novel dual-model architecture with a grouping-attention-fusion strategy to improve the performance of scene classification. Specifically, the model employs two different convolutional neural networks (CNNs) for feature extraction, where the grouping-attention-fusion strategy is used to fuse the features of the CNNs in a fine and multi-scale manner. In this way, the resultant feature representation of the scene is enhanced. Moreover, to address the issue of similar appearances between different scenes, we develop a loss function which encourages small intra-class diversities and large inter-class distances. Extensive experiments are conducted on four scene classification datasets include the UCM land-use dataset, the WHU-RS19 dataset, the AID dataset, and the OPTIMAL-31 dataset. The experimental results demonstrate the superiority of the proposed method in comparison with the state-of-the-arts.

Download Full-text

Ensemble model with cascade attention mechanism for high-resolution remote sensing image scene classification

Optics Express ◽

10.1364/oe.395866 ◽

2020 ◽

Vol 28 (15) ◽

pp. 22358

Author(s):

Fengpeng Li ◽

Ruyi Feng ◽

Wei Han ◽

Lizhe Wang

Keyword(s):

Remote Sensing ◽

High Resolution ◽

Remote Sensing Image ◽

Attention Mechanism ◽

Ensemble Model ◽

Scene Classification

Download Full-text

Multi-Scale Meta-Learning-Based Networks for High-Resolution Remote Sensing Scene Classification

10.1109/igarss47720.2021.9555134 ◽

2021 ◽

Author(s):

Xu Tang ◽

Weiquan Lin ◽

Chao Liu ◽

Xiao Han ◽

Wenjing Wang ◽

...

Keyword(s):

Remote Sensing ◽

High Resolution ◽

Scene Classification ◽

Multi Scale ◽

Meta Learning

Download Full-text

Application of multi-scale segmentation algorithms for high resolution remote sensing image

Applications of Digital Image Processing XL ◽

10.1117/12.2271514 ◽

2017 ◽

Author(s):

Tingting Zhou ◽

Lingjia Gu ◽

Ruizhi Ren

Keyword(s):

Remote Sensing ◽

High Resolution ◽

Remote Sensing Image ◽

Multi Scale ◽

Segmentation Algorithms

Download Full-text

Fusing Deep Local and Global Features for Remote Sensing Image Scene Classification

IGARSS 2019 - 2019 IEEE International Geoscience and Remote Sensing Symposium ◽

10.1109/igarss.2019.8898963 ◽

2019 ◽

Cited By ~ 1

Author(s):

Keli Yan ◽

Shaohui Mei ◽

Mingyang Ma ◽

Feng Yan

Keyword(s):

Remote Sensing ◽

Remote Sensing Image ◽

Scene Classification ◽

Global Features ◽

Local And Global Features

Download Full-text

A Multi-Scale Filtering Building Index for Building Extraction in Very High-Resolution Satellite Imagery

Remote Sensing ◽

10.3390/rs11050482 ◽

2019 ◽

Vol 11 (5) ◽

pp. 482 ◽

Cited By ~ 6

Author(s):

Qi Bi ◽

Kun Qin ◽

Han Zhang ◽

Ye Zhang ◽

Zhili Li ◽

...

Keyword(s):

Remote Sensing ◽

High Resolution ◽

Common Knowledge ◽

Remote Sensing Image ◽

Morphological Operations ◽

Building Extraction ◽

Multi Scale ◽

Training Samples ◽

Image Building ◽

Very High

Building extraction plays a significant role in many high-resolution remote sensing image applications. Many current building extraction methods need training samples while it is common knowledge that different samples often lead to different generalization ability. Morphological building index (MBI), representing morphological features of building regions in an index form, can effectively extract building regions especially in Chinese urban regions without any training samples and has drawn much attention. However, some problems like the heavy computation cost of multi-scale and multi-direction morphological operations still exist. In this paper, a multi-scale filtering building index (MFBI) is proposed in the hope of overcoming these drawbacks and dealing with the increasing noise in very high-resolution remote sensing image. The profile of multi-scale average filtering is averaged and normalized to generate this index. Moreover, to fully utilize the relatively little spectral information in very high-resolution remote sensing image, two scenarios to generate the multi-channel multi-scale filtering index (MMFBI) are proposed. While no high-resolution remote sensing image building extraction dataset is open to the public now and the current very high-resolution remote sensing image building extraction datasets usually contain samples from the Northern American or European regions, we offer a very high-resolution remote sensing image building extraction datasets in which the samples contain multiple building styles from multiple Chinese regions. The proposed MFBI and MMFBI outperform MBI and the currently used object based segmentation method on the dataset, with a high recall and F-score. Meanwhile, the computation time of MFBI and MBI is compared on three large-scale very high-resolution satellite image and the sensitivity analysis demonstrates the robustness of the proposed method.

Download Full-text

Multi-scale segmentation of the high resolution remote sensing image

Proceedings. 2005 IEEE International Geoscience and Remote Sensing Symposium, 2005. IGARSS '05. ◽

10.1109/igarss.2005.1526648 ◽

2005 ◽

Cited By ~ 4

Author(s):

Chen Zhong ◽

Zhao Zhongmin ◽

Yan DongMei ◽

Chen Renxi

Keyword(s):

Remote Sensing ◽

High Resolution ◽

Remote Sensing Image ◽

Multi Scale

Download Full-text