Research on Remote Sensing Image Target Recognition Based on Deep Convolution Neural Network

Author(s):  
Xiaofeng Han ◽  
Tao Jiang ◽  
Zifei Zhao ◽  
Zhongteng Lei

Target recognition is an important application in the time of high-resolution remote sensing images. However, the traditional target recognition method has the characteristics of artificial design, and the generalization ability is not strong, which makes it difficult to meet the requirement of the current mass data. Therefore, it is urgent to explore new methods for feature extraction and target recognition and location in remote sensing images. Convolutional neural network in deep learning can extract representative and discriminative multi-level features of typical features from images, so it can be used for multi-target recognition of remote sensing big data in complex scenes. In this study, NWPU VHR-10 data was selected, 50% was used for training, and the remainder was used for verification. The target recognition effects of two kinds of convolutional neural network models, Faster R-CNN and SSD, were studied and compared, and the mean average precision (mAP) was used for evaluation. The evaluation results show that the Faster R-CNN has three categories with an accuracy of more than 80%, and the SSD has seven categories with an accuracy of more than 80%, all of which show good results. The SSD model is particularly prominent in running time and recognition results, which proves convolutional neural networks have broad application prospects in the target recognition of remote sensing image data.

2019 ◽  
Vol 12 (1) ◽  
pp. 86 ◽  
Author(s):  
Rafael Pires de Lima ◽  
Kurt Marfurt

Remote-sensing image scene classification can provide significant value, ranging from forest fire monitoring to land-use and land-cover classification. Beginning with the first aerial photographs of the early 20th century to the satellite imagery of today, the amount of remote-sensing data has increased geometrically with a higher resolution. The need to analyze these modern digital data motivated research to accelerate remote-sensing image classification. Fortunately, great advances have been made by the computer vision community to classify natural images or photographs taken with an ordinary camera. Natural image datasets can range up to millions of samples and are, therefore, amenable to deep-learning techniques. Many fields of science, remote sensing included, were able to exploit the success of natural image classification by convolutional neural network models using a technique commonly called transfer learning. We provide a systematic review of transfer learning application for scene classification using different datasets and different deep-learning models. We evaluate how the specialization of convolutional neural network models affects the transfer learning process by splitting original models in different points. As expected, we find the choice of hyperparameters used to train the model has a significant influence on the final performance of the models. Curiously, we find transfer learning from models trained on larger, more generic natural images datasets outperformed transfer learning from models trained directly on smaller remotely sensed datasets. Nonetheless, results show that transfer learning provides a powerful tool for remote-sensing scene classification.


Agriculture ◽  
2021 ◽  
Vol 11 (11) ◽  
pp. 1111
Author(s):  
Abozar Nasirahmadi ◽  
Ulrike Wilczek ◽  
Oliver Hensel

Mechanical damages of sugar beet during harvesting affects the quality of the final products and sugar yield. The mechanical damage of sugar beet is assessed randomly by operators of harvesters and can depend on the subjective opinion and experience of the operator due to the complexity of the harvester machines. Thus, the main aim of this study was to determine whether a digital two-dimensional imaging system coupled with convolutional neural network (CNN) techniques could be utilized to detect visible mechanical damage in sugar beet during harvesting in a harvester machine. In this research, various detector models based on the CNN, including You Only Look Once (YOLO) v4, region-based fully convolutional network (R-FCN) and faster regions with convolutional neural network features (Faster R-CNN) were developed. Sugar beet image data during harvesting from a harvester in different farming conditions were used for training and validation of the proposed models. The experimental results showed that the YOLO v4 CSPDarknet53 method was able to detect damage in sugar beet with better performance (recall, precision and F1-score of about 92, 94 and 93%, respectively) and higher speed (around 29 frames per second) compared to the other developed CNNs. By means of a CNN-based vision system, it was possible to automatically detect sugar beet damage within the sugar beet harvester machine.


2021 ◽  
Author(s):  
Chenshuai Bai ◽  
Kaijun Wu ◽  
Dicong Wang ◽  
Hong Li ◽  
Mingjun Yan ◽  
...  

Abstract Because the detection effect of EfficientNet-YOLOv3 target detection algorithm is not very good, this paper proposes a small target detection research based on dynamic convolution neural network. Firstly, the dynamic convolutional neural network is introduced to replace the traditional convolutional neural network, which makes the algorithm model more robust; Secondly, in the training process, the optimization parameters are continuously adjusted to further strengthen the model structure; Finally, in order to prevent over fitting, the Learning Rate and Batch Size parameters are modified during the training process. remote sensing image The results of the proposed algorithm on RSOD remote sensing image data sets show that compared with the original EfficientNet-YOLOv3 algorithm, the (Average Precision, AP) value is increased by 1.93% and the (Log Average Miss Rate ,LAMR) value is reduced by 0.0500; The results of the proposed algorithm on TGRS-HRRSD remote sensing image data set show that compared with the original EfficientNet-YOLOv3 algorithm, the mAP value is increased by 0.07% and the mLAMR value is reduced by 0.0007.


2022 ◽  
Vol 2022 ◽  
pp. 1-14
Author(s):  
Xiu Zhang

Image has become one of the important carriers of visual information because of its large amount of information, easy to spread and store, and strong sense of sense. At the same time, the quality of image is also related to the completeness and accuracy of information transmission. This research mainly discusses the superresolution reconstruction of remote sensing images based on the middle layer supervised convolutional neural network. This paper designs a convolutional neural network with middle layer supervision. There are 16 layers in total, and the seventh layer is designed as an intermediate supervision layer. At present, there are many researches on traditional superresolution reconstruction algorithms and convolutional neural networks, but there are few researches that combine the two together. Convolutional neural network can obtain the high-frequency features of the image and strengthen the detailed information; so, it is necessary to study its application in image reconstruction. This article will separately describe the current research status of image superresolution reconstruction and convolutional neural networks. The middle supervision layer defines the error function of the supervision layer, which is used to optimize the error back propagation mechanism of the convolutional neural network to improve the disappearance of the gradient of the deep convolutional neural network. The algorithm training is mainly divided into four stages: the original remote sensing image preprocessing, the remote sensing image temporal feature extraction stage, the remote sensing image spatial feature extraction stage, and the remote sensing image reconstruction output layer. The last layer of the network draws on the single-frame remote sensing image SRCNN algorithm. The output layer overlaps and adds the remote sensing images of the previous layer, averages the overlapped blocks, eliminates the block effect, and finally obtains high-resolution remote sensing images, which is also equivalent to filter operation. In order to allow users to compare the superresolution effect of remote sensing images more clearly, this paper uses the Qt5 interface library to implement the user interface of the remote sensing image superresolution software platform and uses the intermediate layer convolutional neural network and the remote sensing image superresolution reconstruction algorithm proposed in this paper. When the training epoch reaches 35 times, the network has converged. At this time, the loss function converges to 0.017, and the cumulative time is about 8 hours. This research helps to improve the visual effects of remote sensing images.


Author(s):  
L. Xin

Utilizing high-resolution remote sensing images for earth observation has become the common method of land use monitoring. It requires great human participation when dealing with traditional image interpretation, which is inefficient and difficult to guarantee the accuracy. At present, the artificial intelligent method such as deep learning has a large number of advantages in the aspect of image recognition. By means of a large amount of remote sensing image samples and deep neural network models, we can rapidly decipher the objects of interest such as buildings, etc. Whether in terms of efficiency or accuracy, deep learning method is more preponderant. This paper explains the research of deep learning method by a great mount of remote sensing image samples and verifies the feasibility of building extraction via experiments.


Author(s):  
Xiaochuan Tang ◽  
Mingzhe Liu ◽  
Hao Zhong ◽  
Yuanzhen Ju ◽  
Weile Li ◽  
...  

Landslide recognition is widely used in natural disaster risk management. Traditional landslide recognition is mainly conducted by geologists, which is accurate but inefficient. This article introduces multiple instance learning (MIL) to perform automatic landslide recognition. An end-to-end deep convolutional neural network is proposed, referred to as Multiple Instance Learning–based Landslide classification (MILL). First, MILL uses a large-scale remote sensing image classification dataset to build pre-train networks for landslide feature extraction. Second, MILL extracts instances and assign instance labels without pixel-level annotations. Third, MILL uses a new channel attention–based MIL pooling function to map instance-level labels to bag-level label. We apply MIL to detect landslides in a loess area. Experimental results demonstrate that MILL is effective in identifying landslides in remote sensing images.


Sign in / Sign up

Export Citation Format

Share Document