Deep convolutional neural network structure design for remote sensing image scene classification based on transfer learning

Remote-sensing image scene classification can provide significant value, ranging from forest fire monitoring to land-use and land-cover classification. Beginning with the first aerial photographs of the early 20th century to the satellite imagery of today, the amount of remote-sensing data has increased geometrically with a higher resolution. The need to analyze these modern digital data motivated research to accelerate remote-sensing image classification. Fortunately, great advances have been made by the computer vision community to classify natural images or photographs taken with an ordinary camera. Natural image datasets can range up to millions of samples and are, therefore, amenable to deep-learning techniques. Many fields of science, remote sensing included, were able to exploit the success of natural image classification by convolutional neural network models using a technique commonly called transfer learning. We provide a systematic review of transfer learning application for scene classification using different datasets and different deep-learning models. We evaluate how the specialization of convolutional neural network models affects the transfer learning process by splitting original models in different points. As expected, we find the choice of hyperparameters used to train the model has a significant influence on the final performance of the models. Curiously, we find transfer learning from models trained on larger, more generic natural images datasets outperformed transfer learning from models trained directly on smaller remotely sensed datasets. Nonetheless, results show that transfer learning provides a powerful tool for remote-sensing scene classification.

Download Full-text

High Spatial Resolution Remote Sensing Scene Classification Method using Transfer Learning and Deep Convolutional Neural Network

10.36227/techrxiv.11694966.v1 ◽

2020 ◽

Author(s):

Wenmei Li ◽

Juan Wang ◽

Ziteng Wang ◽

Yu Wang ◽

Yan Jia ◽

...

Keyword(s):

Neural Network ◽

Remote Sensing ◽

Convolutional Neural Network ◽

Transfer Learning ◽

Spatial Resolution ◽

High Spatial Resolution ◽

Deep Convolutional Neural Network ◽

Classification Method ◽

Scene Classification ◽

High Quality

Deep convolutional neural network (DeCNN) is considered one of promising techniques for classifying the high spatial resolution remote sensing (HSRRS) scenes, due to its powerful feature extraction capabilities. It is well-known that huge high quality labeled datasets are required for achieving the better classification performances and preventing over-fitting, during the training DeCNN model process. However, the lack of high quality datasets often limits the applications of DeCNN. In order to solve this problem, in this paper, we propose a HSRRS image scene classification method using transfer learning and DeCNN (TL-DeCNN) model in few shot HSRRS scene samples. Specifically, three typical DeCNNs of VGG19, ResNet50 and InceptionV3, trained on the ImageNet2015, the weights of their convolutional layer for that of the TL-DeCNN are transferred, respectively. Then, TL-DeCNN just needs to fine-tune its classification module on the few shot HSRRS scene samples in a few epochs. Experimental results indicate that our proposed TL-DeCNN method provides absolute dominance results without over-fitting, when compared with the VGG19, ResNet50 and InceptionV3, directly trained on the few shot samples.

Download Full-text

High Spatial Resolution Remote Sensing Scene Classification Method using Transfer Learning and Deep Convolutional Neural Network

10.36227/techrxiv.11694966 ◽

2020 ◽

Author(s):

Wenmei Li ◽

Juan Wang ◽

Ziteng Wang ◽

Yu Wang ◽

Yan Jia ◽

...

Keyword(s):

Neural Network ◽

Remote Sensing ◽

Convolutional Neural Network ◽

Transfer Learning ◽

Spatial Resolution ◽

High Spatial Resolution ◽

Deep Convolutional Neural Network ◽

Classification Method ◽

Scene Classification ◽

High Quality

Deep convolutional neural network (DeCNN) is considered one of promising techniques for classifying the high spatial resolution remote sensing (HSRRS) scenes, due to its powerful feature extraction capabilities. It is well-known that huge high quality labeled datasets are required for achieving the better classification performances and preventing over-fitting, during the training DeCNN model process. However, the lack of high quality datasets often limits the applications of DeCNN. In order to solve this problem, in this paper, we propose a HSRRS image scene classification method using transfer learning and DeCNN (TL-DeCNN) model in few shot HSRRS scene samples. Specifically, three typical DeCNNs of VGG19, ResNet50 and InceptionV3, trained on the ImageNet2015, the weights of their convolutional layer for that of the TL-DeCNN are transferred, respectively. Then, TL-DeCNN just needs to fine-tune its classification module on the few shot HSRRS scene samples in a few epochs. Experimental results indicate that our proposed TL-DeCNN method provides absolute dominance results without over-fitting, when compared with the VGG19, ResNet50 and InceptionV3, directly trained on the few shot samples.

Download Full-text

Scene classification of remote sensing image based on deep convolutional neural network

Tenth International Conference on Digital Image Processing (ICDIP 2018) ◽

10.1117/12.2502942 ◽

2018 ◽

Author(s):

Zhou Yang ◽

Xiao-dong Mu ◽

Feng-an Zhao

Keyword(s):

Neural Network ◽

Remote Sensing ◽

Convolutional Neural Network ◽

Remote Sensing Image ◽

Deep Convolutional Neural Network ◽

Scene Classification

Download Full-text

Remote sensing image segmentation based on the fuzzy deep convolutional neural network

International Journal of Remote Sensing ◽

10.1080/01431161.2021.1938738 ◽

2021 ◽

Vol 42 (16) ◽

pp. 6267-6286

Author(s):

Tianyu Zhao ◽

Jindong Xu ◽

Rui Chen ◽

Xiangyue Ma

Keyword(s):

Neural Network ◽

Remote Sensing ◽

Image Segmentation ◽

Convolutional Neural Network ◽

Remote Sensing Image ◽

Deep Convolutional Neural Network

Download Full-text

Semantic Segmentation of Remote Sensing Images Using Transfer Learning and Deep Convolutional Neural Network With Dense Connection

IEEE Access ◽

10.1109/access.2020.3003914 ◽

2020 ◽

Vol 8 ◽

pp. 116744-116755 ◽

Cited By ~ 1

Author(s):

Binge Cui ◽

Xin Chen ◽

Yan Lu

Keyword(s):

Neural Network ◽

Remote Sensing ◽

Convolutional Neural Network ◽

Transfer Learning ◽

Semantic Segmentation ◽

Deep Convolutional Neural Network ◽

Remote Sensing Images

Download Full-text

Missing Data Reconstruction in Remote Sensing Image With a Unified Spatial–Temporal–Spectral Deep Convolutional Neural Network

IEEE Transactions on Geoscience and Remote Sensing ◽

10.1109/tgrs.2018.2810208 ◽

2018 ◽

Vol 56 (8) ◽

pp. 4274-4288 ◽

Cited By ~ 75

Author(s):

Qiang Zhang ◽

Qiangqiang Yuan ◽

Chao Zeng ◽

Xinghua Li ◽

Yancong Wei

Keyword(s):

Neural Network ◽

Remote Sensing ◽

Missing Data ◽

Convolutional Neural Network ◽

Remote Sensing Image ◽

Deep Convolutional Neural Network ◽

Data Reconstruction

Download Full-text

An Efficient and Lightweight Convolutional Neural Network for Remote Sensing Image Scene Classification

Sensors ◽

10.3390/s20071999 ◽

2020 ◽

Vol 20 (7) ◽

pp. 1999 ◽

Cited By ~ 6

Author(s):

Donghang Yu ◽

Qing Xu ◽

Haitao Guo ◽

Chuan Zhao ◽

Yuzhun Lin ◽

...

Keyword(s):

Neural Network ◽

Remote Sensing ◽

Convolutional Neural Network ◽

Visual Recognition ◽

Feature Fusion ◽

Remote Sensing Image ◽

Classification Performance ◽

Image Features ◽

Training Dataset ◽

Scene Classification

Classifying remote sensing images is vital for interpreting image content. Presently, remote sensing image scene classification methods using convolutional neural networks have drawbacks, including excessive parameters and heavy calculation costs. More efficient and lightweight CNNs have fewer parameters and calculations, but their classification performance is generally weaker. We propose a more efficient and lightweight convolutional neural network method to improve classification accuracy with a small training dataset. Inspired by fine-grained visual recognition, this study introduces a bilinear convolutional neural network model for scene classification. First, the lightweight convolutional neural network, MobileNetv2, is used to extract deep and abstract image features. Each feature is then transformed into two features with two different convolutional layers. The transformed features are subjected to Hadamard product operation to obtain an enhanced bilinear feature. Finally, the bilinear feature after pooling and normalization is used for classification. Experiments are performed on three widely used datasets: UC Merced, AID, and NWPU-RESISC45. Compared with other state-of-art methods, the proposed method has fewer parameters and calculations, while achieving higher accuracy. By including feature fusion with bilinear pooling, performance and accuracy for remote scene classification can greatly improve. This could be applied to any remote sensing image classification task.

Download Full-text

Multi-Label Remote Sensing Image Scene Classification by Combining a Convolutional Neural Network and a Graph Neural Network

Remote Sensing ◽

10.3390/rs12234003 ◽

2020 ◽

Vol 12 (23) ◽

pp. 4003

Author(s):

Yansheng Li ◽

Ruixian Chen ◽

Yongjun Zhang ◽

Mi Zhang ◽

Ling Chen

Keyword(s):

Neural Network ◽

Remote Sensing ◽

Convolutional Neural Network ◽

Remote Sensing Image ◽

Superior Performance ◽

Human Beings ◽

Scene Classification ◽

Scene Graph ◽

Visual Elements ◽

Topological Relationships

As one of the fundamental tasks in remote sensing (RS) image understanding, multi-label remote sensing image scene classification (MLRSSC) is attracting increasing research interest. Human beings can easily perform MLRSSC by examining the visual elements contained in the scene and the spatio-topological relationships of these visual elements. However, most of existing methods are limited by only perceiving visual elements but disregarding the spatio-topological relationships of visual elements. With this consideration, this paper proposes a novel deep learning-based MLRSSC framework by combining convolutional neural network (CNN) and graph neural network (GNN), which is termed the MLRSSC-CNN-GNN. Specifically, the CNN is employed to learn the perception ability of visual elements in the scene and generate the high-level appearance features. Based on the trained CNN, one scene graph for each scene is further constructed, where nodes of the graph are represented by superpixel regions of the scene. To fully mine the spatio-topological relationships of the scene graph, the multi-layer-integration graph attention network (GAT) model is proposed to address MLRSSC, where the GAT is one of the latest developments in GNN. Extensive experiments on two public MLRSSC datasets show that the proposed MLRSSC-CNN-GNN can obtain superior performance compared with the state-of-the-art methods.

Download Full-text