Rotation Invariance Regularization for Remote Sensing Image Scene Classification with Convolutional Neural Networks

Deep convolutional neural networks (DCNNs) have shown significant improvements in remote sensing image scene classification for powerful feature representations. However, because of the high variance and volume limitations of the available remote sensing datasets, DCNNs are prone to overfit the data used for their training. To address this problem, this paper proposes a novel scene classification framework based on a deep Siamese convolutional network with rotation invariance regularization. Specifically, we design a data augmentation strategy for the Siamese model to learn a rotation invariance DCNN model that is achieved by directly enforcing the labels of the training samples before and after rotating to be mapped close to each other. In addition to the cross-entropy cost function for the traditional CNN models, we impose a rotation invariance regularization constraint on the objective function of our proposed model. The experimental results obtained using three publicly-available scene classification datasets show that the proposed method can generally improve the classification performance by 2~3% and achieves satisfactory classification performance compared with some state-of-the-art methods.

Download Full-text

Transferring Deep Convolutional Neural Networks for the Scene Classification of High-Resolution Remote Sensing Imagery

Remote Sensing ◽

10.3390/rs71114680 ◽

2015 ◽

Vol 7 (11) ◽

pp. 14680-14707 ◽

Cited By ~ 513

Author(s):

Fan Hu ◽

Gui-Song Xia ◽

Jingwen Hu ◽

Liangpei Zhang

Keyword(s):

Remote Sensing ◽

Neural Networks ◽

High Resolution ◽

Convolutional Neural Networks ◽

Scene Classification ◽

Deep Convolutional Neural Networks ◽

Remote Sensing Imagery

Download Full-text

PulseNetOne: Fast Unsupervised Pruning of Convolutional Neural Networks for Remote Sensing

Remote Sensing ◽

10.3390/rs12071092 ◽

2020 ◽

Vol 12 (7) ◽

pp. 1092

Author(s):

David Browne ◽

Michael Giering ◽

Steven Prestwich

Keyword(s):

Remote Sensing ◽

Neural Networks ◽

Deep Learning ◽

Convolutional Neural Networks ◽

Data Augmentation ◽

Recognition Task ◽

Scene Recognition ◽

Training Data ◽

Learning Approach ◽

Scene Classification

Scene classification is an important aspect of image/video understanding and segmentation. However, remote-sensing scene classification is a challenging image recognition task, partly due to the limited training data, which causes deep-learning Convolutional Neural Networks (CNNs) to overfit. Another difficulty is that images often have very different scales and orientation (viewing angle). Yet another is that the resulting networks may be very large, again making them prone to overfitting and unsuitable for deployment on memory- and energy-limited devices. We propose an efficient deep-learning approach to tackle these problems. We use transfer learning to compensate for the lack of data, and data augmentation to tackle varying scale and orientation. To reduce network size, we use a novel unsupervised learning approach based on k-means clustering, applied to all parts of the network: most network reduction methods use computationally expensive supervised learning methods, and apply only to the convolutional or fully connected layers, but not both. In experiments, we set new standards in classification accuracy on four remote-sensing and two scene-recognition image datasets.

Download Full-text

Assessment and Impact of Feature Extraction Methods for Hyperspectral Remote Sensing Image Classification Based on Deep Convolutional Neural Networks

10.1109/icirca51532.2021.9544558 ◽

2021 ◽

Author(s):

Venkata Gopi Mandoori ◽

Radhesyam Vaddi

Keyword(s):

Remote Sensing ◽

Neural Networks ◽

Feature Extraction ◽

Image Classification ◽

Convolutional Neural Networks ◽

Remote Sensing Image ◽

Extraction Methods ◽

Deep Convolutional Neural Networks ◽

Remote Sensing Image Classification ◽

Hyperspectral Remote Sensing Image

Download Full-text

Enhanced Interactive Remote Sensing Image Retrieval with Scene Classification Convolutional Neural Networks Model

IGARSS 2018 - 2018 IEEE International Geoscience and Remote Sensing Symposium ◽

10.1109/igarss.2018.8518388 ◽

2018 ◽

Cited By ~ 2

Author(s):

Yaakoub Boualleg ◽

Mohamed Farah

Keyword(s):

Remote Sensing ◽

Neural Networks ◽

Image Retrieval ◽

Convolutional Neural Networks ◽

Remote Sensing Image ◽

Scene Classification

Download Full-text

FusionCNN: a remote sensing image fusion algorithm based on deep convolutional neural networks

Multimedia Tools and Applications ◽

10.1007/s11042-018-6850-3 ◽

2018 ◽

Vol 78 (11) ◽

pp. 14683-14703 ◽

Cited By ~ 8

Author(s):

Fajie Ye ◽

Xiongfei Li ◽

Xiaoli Zhang

Keyword(s):

Remote Sensing ◽

Neural Networks ◽

Image Fusion ◽

Convolutional Neural Networks ◽

Remote Sensing Image ◽

Deep Convolutional Neural Networks ◽

Fusion Algorithm ◽

Remote Sensing Image Fusion

Download Full-text

Remote Sensing Image Classification via Improved Cross-Entropy Loss and Transfer Learning Strategy Based on Deep Convolutional Neural Networks

IEEE Geoscience and Remote Sensing Letters ◽

10.1109/lgrs.2019.2937872 ◽

2020 ◽

Vol 17 (6) ◽

pp. 1087-1091 ◽

Cited By ~ 2

Author(s):

Ali Bahri ◽

Sina Ghofrani Majelan ◽

Sina Mohammadi ◽

Mehrdad Noori ◽

Karim Mohammadi

Keyword(s):

Remote Sensing ◽

Neural Networks ◽

Image Classification ◽

Transfer Learning ◽

Convolutional Neural Networks ◽

Learning Strategy ◽

Remote Sensing Image ◽

Cross Entropy ◽

Deep Convolutional Neural Networks ◽

Remote Sensing Image Classification

Download Full-text

Vision Transformers for Remote Sensing Image Classification

Remote Sensing ◽

10.3390/rs13030516 ◽

2021 ◽

Vol 13 (3) ◽

pp. 516

Author(s):

Yakoub Bazi ◽

Laila Bashmal ◽

Mohamad M. Al Rahhal ◽

Reham Al Dayil ◽

Naif Al Ajlan

Keyword(s):

Remote Sensing ◽

Language Processing ◽

Additional Data ◽

Data Augmentation ◽

State Of The Art ◽

Remote Sensing Image ◽

Classification Performance ◽

Scene Classification ◽

Remote Sensing Image Classification ◽

Augmentation Strategies

In this paper, we propose a remote-sensing scene-classification method based on vision transformers. These types of networks, which are now recognized as state-of-the-art models in natural language processing, do not rely on convolution layers as in standard convolutional neural networks (CNNs). Instead, they use multihead attention mechanisms as the main building block to derive long-range contextual relation between pixels in images. In a first step, the images under analysis are divided into patches, then converted to sequence by flattening and embedding. To keep information about the position, embedding position is added to these patches. Then, the resulting sequence is fed to several multihead attention layers for generating the final representation. At the classification stage, the first token sequence is fed to a softmax classification layer. To boost the classification performance, we explore several data augmentation strategies to generate additional data for training. Moreover, we show experimentally that we can compress the network by pruning half of the layers while keeping competing classification accuracies. Experimental results conducted on different remote-sensing image datasets demonstrate the promising capability of the model compared to state-of-the-art methods. Specifically, Vision Transformer obtains an average classification accuracy of 98.49%, 95.86%, 95.56% and 93.83% on Merced, AID, Optimal31 and NWPU datasets, respectively. While the compressed version obtained by removing half of the multihead attention layers yields 97.90%, 94.27%, 95.30% and 93.05%, respectively.

Download Full-text